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Introductory. 

I understand by a factor of evolution any source of progressive change in the 
constants — mean values, variabilities, correlations—which suffice to define an organ 
or character, or the interrelations of a group of organs or characters, at any stage in 
any form of life. To demonstrate the existence of such a factor we require to show 
more than the plausibility of its effectiveness, we need that a numerical measure of 
the changes in the organic constants shall be obtained from actual statistical data. 
These data must be of sufficient extent to render the numerical determinations large 
as compared with their probable errors. 

In a " Note on Reproductive Selection," published in the ' Roy. Soc. Proc./ vol. 59, 
p. 301, I have pointed out that if fertility be inherited or if it be correlated with 
any inherited character— those who are thoroughly conversant with the theory of 
correlation will recognise that these two things are not the same — then we have a 
source of progressive change, a vera causa of evolution. I then termed this factor 
of evolution Reproductive Selection. As the term has been objected to, I have 
adopted Genetic Selection as an alternative. 1 mean by this term the influence of 
different grades of reproductivity in producing change in the predominant type. 

If there be two organs A and B both correlated with fertility, but not necessarily 
correlated with each other, * then genetic or reproductive selection may ultimately 
cause the predominance in the population of two groups, in which the organs 
A and B are widely different from their primitive types—' widely different/ because 
reproductive selection is a source of progressive change. Thus this form of selection 
can be a source, not only of change, but of differential change. As this differentia- 
tion is progressive, it may amount in time to that degree of divergence at which 
crossing between the two groups begins to be difficult or distasteful. We then 
reach in genetic or reproductive selection a source of the origin of species. 

When I assert that genetic (reproductive) selection is a factor of evolution, I do 
not intend at present to dogmatise as to the amount it is playing or has played in 
evolution. I intend to isolate it so far as possible from all other factors, and then 
measure its intensity numerically. If this be sensible, then the demonstration that 
it is a factor is complete. How far it may be held in check by other factors— 
e.g., natural or sexual selection — is a matter for further inquiry. If three forces, 
F 1? F 2 , F 3 hold a system sensibly in equilibrium, then F x cannot be asserted to be 
non-effective because no progressive change is visible ; its absence would soon bring 
to light its effectiveness. 

The manner in which genetic (reproductive) selection is to some extent held in 
check will be clearer when my memoir on the influence of directed selection on 

* If r ab be the correlation of two organic characters A and B, and be a. third character, there is a 
considerable range of values of r ac and r bc , for which r ab may be zero (see Yule, c Boy. Soc. Proc.' 
yoI. 60, p. 486), 
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variation and correlation is published. Meanwhile Mr. Filon and I have shown 
that even a random selection of one organ alters the whole system of correlated 
organs. # Hence genetic (reproductive) selection indirectly modifies not only organs 
A and B, but all correlated organs. These modifications must be consistent with 
the maintenance of stamina, physique and fitness to the environment, if the change 
is not to be counteracted by natural selection. 

So far as man is concerned, I have shownt that in the case of civilised man, the 
selective death-rate- — i.e., natural selection — does not appear to counteract repro- 
ductive selection. A. small element of the population produces the larger part of the 
following generation. I thus concluded that if fertility were inherited, reproductive 
selection was not only a factor of evolution, but in civilised man a very sensible 
factor, i.e., an apparently incompletely balanced factor. 

In the three years which have intervened since writing the essay just referred to, 
members of the Department of Applied Mathematics in University College, as well 
as other friends, have occupied their spare time in the collection of data as to fertility 
and fecundity in the cases of man and of the thoroughbred racehorse. About 
16,000 extracts were made in the case of man, and more than 7000 in the case of 
thoroughbred racehorses. In the course of the work, which proved far more laborious 
than we had anticipated, many difficulties and pitfalls appeared. But as a general 
conclusion it seems certain that : Both fertility and fecundity are inherited, and 
probably in the manner prescribed by the Law of Ancestral Heredity, .\ 

The object of this memoir is to set forth the theory and data by aid of which 
this conclusion was reached. It will be seen that It completes the establishment 
of genetic or reproductive selection as a factor of evolution by determining the much 
disputed point as to whether fertility is or is not inherited. 

I. Theory of Genetic or Reproductive Selection. By Karl Pearson*, F.RS. 

(1.) While the physical result of fertility in an individual is measurable, the 
quality of fertility or fecundity in an individual differs from other physical characters 
in that it does not allow of direct measurements except when the potentiality is 
exerted and the effects recorded. At present we are not able to measure any series 
of organs or characters in individuals and so ascertain their fertility or fecundity. 
At the same time there is little doubt that these characters are functions of the 
physical and measurable organs and characters of the body. Such organs and 
characters we have good ground for supposing to be inherited according to the Law 

# " Contributions to the Theory of Evolution.— IV. On the Influence of Random Selection on Variation 
and Correlation," 'Phil. Trans.,' A, vol. 191, p. 234 et seq. 

f " The Chances of Death and other Studies in Evolution. Reproductive Selection," vol, 1, p. 63, 
t See ' Roy. Soc. Proc.,' vol. 62, p. 386, 

H2 
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of Ancestral Heredity. It seems therefore worth while to prove the following 
proposition : 

Proposition L — Any character not itself directly measurable, hut a function of 
physically measurable characters and organs inherited according to the Laiv of 
A.ncestral Heredity, will itself he inherited according to that law. 

Thus if we assume intellectual and emotional characters to be ultimately a result 
of physical conformation, we may he fairly certain that although we know neither 
the organs of which they are a function, nor the nature of that function, still they 
will be inherited according to the same law as that which holds for physically 
measurable organs. 

Leo y be the character in a parent, and let it be an unknown function f of the 
unknown physical organs x u x 2i x S} . . . x m , or let : 



Let Ay denote the deviation from the mean value of the character y in some special 
individual, and Ace the deviation from the mean of any x organ in the same individual. 
Then if these deviations be small compared with the mean values of the organs 
considered, we have from (i.) above : 

Ay = a Y Ax x + a 2 Ax 2 + a s Ax d J r , . . . . (ii,), 

where a l9 a 2 . . . are constants independent of the individual variations. 

Let or denote a standard deviation, p a coefficient of interorganic correlation, S a 
summation with regard to all individuals with character y dealt with, and let them 
be n in number. Then : 

no* = S (Ay) 2 = S (cbi Ax l + a 2 Ax 2 + a z Ax 3 + • • °) 2 

= n (aj'cj^ + a\al % + a^\ + . . . + 2a l a 2 <T Xl (r X2 p XlX2 + 2a 1 a s <r Xl (r Xs p XlX3 + ...); 

or 

= t(d\(il h ) + 2% (a l a 2 (T Xi a X2 p XlX ) ...... (hi.), 



.2 
?/ 



where 2 denotes a summation through the group of m organs. 

Let y denote the character in an individual who is the offspring of the individual 
of character y, and x' l9 x' 2i x' z . . . the corresponding organs. Then, if we do not 
suppose the nature of the function / to have changed in a single generation, we 
have : 

y =: J {X i 9 X 2> W 3 • ' • & m) i 

and 

Ay' z=z a,i Ax\ + a 2 Ax f 2 + a 3 Ax\ + . . . . . . . . (iv.), 

0*, — t (a 2 ! crl) + 2S(a 1 a 2 av 1 oy 2 /x, v J ...... (v.), 

Let r be a coefficient of direct heredity expressing the correlation between parent 
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and offspring, and according to the Law of Ancestral Heredity the same for all 
organs. Then multiplying (ii.) and (iv.) together and summing we have : 

ncTyO-yi'R = S (&y&y f ) = % (Vf S (^\x l Ax\)) + S (a^S (A^A^ + Ax 2 &x\)) 9 

where R is the coefficient of correlation between the characters y and y in parent and 
offspring. Now : 

S(Aa3 1 Ao3 , J ) = nar Xi <j x ,r 

S(Ax* 1 A t x / 2 + AajgAaj'i) = n<r Xi (r xl r XiX , % + ftcr^cr^r^, 

where r,«, andr y - are what I have elsewhere termed coefficients of cross- heredity. 
Now if the race be stable or sensibly stable for two generations we shall have for all 
organs ov = or x . Hence : 

S (AXiAx'i) = wcr^ X r 

S (A.^ Ax 2 + Aa; 2 Aa; , 1 ) = ncr Xi <r X9 (r x ^ + ?y ia J = ncr^cr^ X 2r/> w 

for it is shown in my memoir on the Law of Ancestral Heredity 4 '" that on a probable 
hypothesis : 

Thus we find on substitution : 

or y ay E = r (S (a\ a*) + 25 (a x a 2 <r Xl a^ /vJ). 

But (iii.) and (iv.) show us that o~ y = ay, if there be no sensible changes in a 
generation. Hence : 

o- y(Ty , — (S(a?cr^) + 2S (^la^^^p^)), 

and 

R = n 

Thus the character which is a function of physical organs is inherited at the same 
rate as those organs themselves. 

As we may not unreasonably consider fertility and fecundity to be functions of 
physically measurable organs, even if we cannot specify which organs, we may, 
a priori, expect fertility and fecundity to be inherited characters. 

(2.) Proposition II. — To determine the numerical values of the changes in mean 
variation and correlation if fertility be inherited. 

Let us first define two terms which will be frequently used in the sequel. 

(a.) The fertility of an individual shall be defined as the total number of actual 
offspring. 

* ' Roy. Soc. Proc.,' vol. 62, p. 411. The hypothesis yet awaits an experimental verification. The 
need to use it prevents Proposition 1. being self-evident. 
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(&.) The fecundity of an individual shall be defined as the ratio of the total number 
of actual offspring to the total number of offspring which might have come into 
existence under the circumstances. 

These definitions are not intended to give precise statistical measures at this stage 
of our investigations. They are merely meant to convey a general sense of the 
words, which will be more precisely limited when they are applied to any given species. 
Fertility and fecundity, as we have thus defined them, leave out of account individual 
conditions and definite conditions of period, age and environment, which must be 
fully stated before numerical measures can be made in any special case. When the 
words are used in this theoretical section the reader must suppose the phrase, " under 
definite individual and environmental conditions/' to be always inserted. 

Let Mi = the mean fertility of parents of one sex ; M^ = the mean fertility of 
parents of one sex weighted with their fertility ;* N x the number of parents con- 
sidered in the first case, N'j the apparent number dealt with in the second case ; 
let ci and rri be the standard deviations in the two cases, and let x represent the 
fertility of an individual parent and z its frequency among N\ parents. Let S 
denote summation for N x parents. Then, without any assumption as to the type of 
frequency, Ni = S (Xxz) = XM/Nj, where X is a constant such that Xx is the weight 
of a parent of fertility x. This follows at once, since : 

N x = S (z) 9 M, = S (xz)/S (z). 
Further. 

— S {(^-~ M i) 2 z + 2Mi (x z) - Mja} 

~~ -MxNx " """ ' 

— jjjgf ± 2M i N 'i Z M ' y i 

by the definition of standard-deviation. Hence, finally : 



9. 

or 



Further : 



'2 



S {Xx(x -M\ fz} __ S {Q - Mi + Mi) (x-M 1 + M, - M'^'g} 



Hence, multiplying out, we find after some reductions : 

of \ . S {<> - Mi) s z] 



72 



Cri "~ C7l \ i Mf/" 1 " MjN, " e ' * ' * * l W ' 

At first sight it might seem a comparatively easy matter to avoid weighting parents 
with their fertility, but practically it is almost impossible. For example, if records 

* i t e., if / be the fertility of a parent, each parent is repeated \f times, where X is a constant, 
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are sought of the fertility of mothers in mankind, the women will appear under their 
husbands' names, and the labour of ascertaining whether two sisters have been 
included is enormous, when large numbers are dealt with. But if two or more sisters 
have been included, their mother has been weighted with her fertility, and when we 
seek the correlation between mother and daughter, it will be between mothers and 
daughters when weighted with fertility. Bat a still more serious difficulty arises 
from the fact that all records are themselves weighted records ; the same number are 
not married from each family, hence we are more likely to find a member of* a large 
family included than a member of a small. The large families, when we seek a record of 
two generations, are more likely to appear than small families. Precisely the same diffi- 
culty occurs when we are dealing with thoroughbred horses ; a mare with large 
fertility is less likely to have all her offspring colts, or all her progeny sold abroad, 
some one or more will probably ultimately come to the stud, and thus mares of large 
fertility are, d priori, more likely to contribute to our fecundity correlation cards. 
We do not get over this difficulty by taking the mother and only one of her offspring. 
The record is still weighted with fertility. The practical verification of this lies 
in the experience that the fertility of mothers will always be found to be greater 
than that of daughters, although the fertility of the community may really be 
increasing ; the weighting, of course, excludes sterility in the generation of mothers, 
but the mere exclusion of the sterile is far from accounting for the whole difference. 

What we actually find from our records are M' x and a\, but what we want for the 
problem of heredity are M x and cr 1 . Equations (i.) and (ii.) do not suffice to determine 
these, because we cannot evaluate the third moment S {(x — Mifz}. We can hardly, 
even for a first approximation, assume it zero, for the standard-deviation, and there- 
fore the individual variation is large as compared with the mean in the case of 
fertility, i.e., the distribution is markedly skew. 

Turning to offspring of the same sex as the parents, say : let M 2 be the mean 
fertility of offspring taking one only to one parent for the number Ni of parents, 
supposing the parents not weighted with their fertility ; let M' 2 be the mean in the 
same case when the parents are weighted with their fertility ; and let M" 2 be the 
mean of all recorded offspring of the second generation. Let cr 2 , a"' 2 , <r" 2 be the 
standard deviations in the fertility of the offspring for the same three cases, and 
t, r\ r" be the corresponding coefficients of correlation between fertility in parent 
and in offspring. It seems to me that r is the coefficient which actually measures 
the real inheritance of fertility, but that in any correlation table that we can form 
we shall get r or r" . 

Let y be the fertility of any individual among the offspring, and x the fertility of 
the corresponding parent ; let \x as before be the weighting of the parent, and \ f x 
the number of offspring included in the record, X' being supposed a constant. # 

* I have been unable so far to find any sensible correlation between size of family and number 
married in man, but the point is worth a more elaborate investigation. 
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We have at once the following" results for the total numbers dealt with in each 

N t = S (z), W, = S (Xxz) = XM^,, 

W 1 = S(\x\'xz) = \\'S(x i z)=:\\'(ai + MT)13 1 .... (iii.). 



Turning to the means : 



M 2 = S (yzyNi (iv.). 



M' 2 = S (\xyz)[8\ = [S { (x - M,) (y - M 2 ) z] + M^S (^j/M^ = M 2 + r ^ (v.). 

XtXi 

M" 2 - S (Xx\'xyz)/W\ =: M 2 + ^g + ^^^ 

— M -i- . °^-- 2 a. • ^ M i z gi _j s Kg ~ M i) 2 (y ~ Mj) ^ 

_ ivi 2 -j- r M ^- -t- Mi ^ + M , i- ^ ^ + M ^ 

after some reductions. Now make use of (ii.) and we have : 



S {(x - M,)* ( ( ?/ - M 2 ) - r -- a (x - Mj ) \z) 
M", - M v 4- r -^ 4 r^-^- 4- - V - 1 ' - (vi ^ 

But for normal correlation the equation to the straight line of regression is : 

y — M 2 = r - J - (x ~ MA 

Hence for such correlation the mean value of y — Mo for parents x — M x is equal 

to r— (#: — M x ) and the summation term would vanish. For skew^ correlation, 

Mr. Yule has shown that the line just given is the line of closest fit to the curve ol 
regression, Hence even in the case of fertility, where the correlation is certainly 
skew, the summation term must be extremely small, or even zero. It follows, there- 
fore, that we may write : 



f" — M -i- v g " lg " 2 / 1 L --U°l \ (xr\\\ 

2 ~ 2l " M 1 \ "'" 1 + o\jU\) •••••• \ vl1 -)- 



If 
1? 



There is still another mean which ought to be found, namely? that of parents, 
when all their recorded offspring have been entered on the correlation, table. We 

have * 

M" x = S (XxX'xxzj/W, = S ^/{N, (a-'i + Mf)}, 

or, after some reductions : 

9 / /*> / 9 \ 



", = M, 4- — ( 1 4- — fli/^i — ) / v iii \ 

1 l ^ M,. \ 1 4 fff/Mf/ •••••• t. vul 4- 
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I now proceed to the standard deviations for the three cases, and the additional 
case (cr"i) for parents. 



~-2 
^2 



~/2 
(T a 



= S{(t/-M 2 ) 2 z}/N', (ix.). 



S{x(y- M 2 + M 2 - M' 2 ) a z3/M,N, 

S{x(y - M 2 ) 2 *} + 28{x(y - M 2 >} (M 2 



M' 2 ) + M 1 (M 2 -M' 2 ) 2 N 1 



M^ 



Whence, after some reductions, we find : 



'2 

<r 2 



= of 1 + r 



.2/ °1 



1 



s 



*>*■*•«■» 



at 



\ 



(x - MO ( (^/ ~M 2 ) 2 - r 2 ~ <> ~- M x ) 2 J z 



MiN] 



. (x.), 



Now for a nearly straight line of regression : 



0*2 

0*1 



y — M 2 = r ~ 2 (x — Mi) + t? 



where 17 is uncorrelated with x — M x . It follows accordingly that S{(x — M^tjz] and 
S{(x — M.i)r) 2 z] will both vanish, since S (??) for an array and S (x — M 2 ) for the 
whole correlation surface will be zero. Hence the summation term in (x.) is either 
absolutely zero or extremely small. We have accordingly : 



'2 

cr 2 



o-l 1 + r< 



/ '2 
/<""l 



t . 



. . • I Al> l« 



Before we proceed to determine <x" 2 and <r" 1 it seems simplest to find the coefficients 
of correlation r x r and r". We have : 



r = S{(» - 'My (y - M 2 )^}/(N 1 cr 1 o- 2 ) ..... (xii.). 



To find r we have : 



r = S{Xx2.(a? - M\) (2/ - M / 2 )}/(N» / 2 ). 



Now 



y-M 5 



ro-o 



cr. 



(a, - M,) + 77, 



where 77 is sensibly un-correlated with x — Mj. Hence : 



Wicr\cr f 2 v = S 



<r 2 

o"i 



Xxz (x - M' 2 ) (r ~ 2 (a? - M x ) + M 2 - M' a + i? 



Expanding, the summations with 77 vanish, and 
vol. cxcii. — A. 2 M 
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M,-M' 






TOT* / or 



CTj \M } 



p + M x ) by (v.) 



/Vl 



<r 2 



a\ 



M'i by (i.) 



But 
thus : 



cr'?=S{Xa!2(a!-M / 1 ) 2 }/N / 



i? 



v \ fcr 2 






or-. 



ra, y 



S{\xz(x — M^) 2 } 



or, 



2 "vjv /«> 



Thus we deduce : 



or : 



0"l cte 



rcrdcr'i =■ 'rcr 2 /cr ] 



* « 9 t -A- I k J.» I» 



This result has the simple interpretation that while the coefficient of correlation is 
changed, the coefficient of regression is unchanged by weighting fertility \ or by 
reproductive selection. 

This important conclusion is only an illustration of a very interesting theorem, 
which has been referred to in another memoir # and will be proved generally in a 
memoir on directed selection, written but not yet published, i,e n that in a wide range 
of cases selection, whether random or directed (natural and artificial) changes 
correlation but not regression. 

Before proceeding further a general remark will enable us to considerably simplify 
the otherwise lengthy algebra. Namely, the relation of M'^, M" 2 , o-" u a /f 2 , v" to M' x , 
M' 2 , cr'i, cr' 2i r is precisely the same as that of M' 1? M' 2 , <r' u <r' 2 , r themselves to M. u 
M 2 , cr 1? cr 2 , r. Consequently an interchange of symbols in results already found will 
lead us to the remaining formulae needful. 

As an illustration of this, let us verify the result we have found for M" 2 . By an 
interchange in (v.) : 



M.'\ = M' 2 + r 



f o* icr 2 



M 



/ ? 



hence using (v.), (i.) and (xiii.), we find : 



M", 



X 



+ 



rcr, 0\r 



l w 2 



M x 



+ T(T 2 



or 



h). 



ar% 



V\ ■*■*■*- 1 ~~r~ -. ,- 



M-2 "4~ 



TO-^CF.2 { 



xVJLi 



1 + 



o J \i<j\ 



1 + o-f/M 



* " Contributions to the Theory of Evolution.— -V. On the Reconstruction of Stature," ' Phil. Trans., 5 
A, vol. 192, p. 177. 
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exactly the result reached by a longer process in (vii.). Similarly (viii.) may be 
deduced from (i.). Applying this to find r" we have from (xiii.) : 

r"o-" 2 /cr"i = t v >i\<y u and therefore = rcr 2 /cri .... (xiv.), 

a result which again extends the constancy of the regression coefficient under the 
action of reproductive selection. 



Next from (xi.) : 



n 2 '2 J 1 I „J2 / 1 t 



O'l 



a 



<7i /I \ a 



or using (xiii.) and rearranging : 



o-" 8 



<r"l = cr\\ 1 + r 2 l —p — 1 ) V . . (xv.). 



o-i 



Again by interchanges in (ii.) : 

Here z stands for Xxz, and we should obtain a fourth moment of the original 
system of unweighted parents by substitution. But it is practically impossible to 
obtain a correlation table for such a system. Thus it is better to allow the sum- 
mation term to stand as it is, where it represents the third moment of a system 
of parents, weighted for fertility owing to the nature of the record, but not weighted 
with all their recorded offspring, (xvi.) is then a relation between the standard- 
deviations of parents weighted solely by forming a record and weighted both by this 
and by their offspring. 

Equations (i.) to (xvi.) contain the chief theoretical relations of our subject, # and 
I shall consider some points with regard to them in the following section. 

(3.) (a.) If we wish to ascertain whether fertility is inherited, we have to discover 
whether r is or is not zero. Now by (xiv.) r vanishes with both r and r\ and accord- 
ingly either of these will suffice to answer the problem. Still better, we may ascertain 
the coefficient of regression, and then whether our statistics weight for progeny or 
not we shall obtain the same value. If there be no secular change taking place in the 
population, due to something else than reproductive selection, we should expect, 
provided the Law of Ancestral Heredity holds for fertility, that the regression will 
be near '3 for parent and offspring.! 

# Two of these formulas, (v.) and (xi.), were given, but in a less precisely defined manner, in my 
" Note on Reproductive Selection " of 1896, 'Roy. Soc. Proc.,' vol. 59, p. 303. 
t See "Law of Ancestral Heredity," ' Roy. Soc. Proc.,' vol. G2, p. 397. 

2 M 2 
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(h.) If no reproductive selection exists, i.e., if fertility be not inherited, then r = 0, 

and 

cr" 2 = a 2 = cr 2 , M 2 = M/ 2 = M" 2 , 

or, however we form a record of offspring, the mean value and variability of their 
fertility ought not to be changed. We shall see later that this is very far from the 
truth, and that these values are in whole or part sensibly affected by the manner in 
which the record is formed. 

(c.) Although there be no reproductive selection, M 1} M^, and M"x will not all be 
equal, it is impossible that they should be. Further. cr u or\ and cr'\ need not be 
equal ; their degree of sensible divergence will depend on the nature of the primitive 
frequency distribution for parents. 

(d.) If fertility be inherited, or reproductive selection be an actual factor of 
evolution, then we see, by comparing (v.) with (i.) and (vii.) with (viii.), that the 
mean fertility of mothers will always be apparently greater than the mean fertility 
of daughters. This follows, since r is always less than unity, and if the race be not 
subjected to secular evolution, other than that due to reproductive selection, a 2 
cannot differ very widely from av* 

(e.) An argument from means, as to whether fertility is inherited or not, is very 
likely to be misleading. We may choose two groups from the record for comparison, 
neglecting the fact that their frequency in the record is not necessarily that of their 
frequency in the general population. Thus, if one person, say, in four were married, 
a marriage record of the community might exhibit the proper frequency of families 
of four, but it would not do so of families of one. The sort of fallacious arguments 
we have to be prepared for are, for example : 

(i.) That the fertility of the community is diminishing, because M' 2 is less 

than M'i. 
(ii.) That the fertility of the community is increasing, because M' 2 might be > Mj 

or M" 2 be > M\. 
(iii.) That fertility is not inherited, because, owing to natural selection, or other 

factor of evolution, one or other of these means for offspring is sensibly equal 

to one or other of these means for parents. 

Owing to the extreme difficulty of insuring that the method of extracting the 
record really gives us definitely M' 2 , say, and not M" 2 (or M" 2 in part), I have 
discarded all use of the mean values in attempting to ascertain whether fertility is 
inherited. The following result, however, is tempting, and might possibly be made 

* A difference between a Y and <r 2 would mark natural selection, sexual selection, or some other factor 
of secular evolution at work ; of secular, not periodic, evolution, as parents and offspring must have 
reached the same adult stage to have had their fertility measured; 
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use of in direct experiments on breeding insects, where a record could be kept ad 
hoc. It follows at once from (i.), (v.), (vii.) and (viii.) : 



JYJL 9 ~~~ -LV_Lo JAlL o *"~" J.VLL 9 Co >y"» • r» * / • « \ 

- — _ — _^ z=z r __ _ coenicient ot regression . . (xvii.j. 

It is the second ratio which, I think, might with profit be experimentally 
evaluated. 

(f) Since the mean fertility of daughters loaded with the fertility of their 
mothers is the fertility of the next generation, and we see that this is always greater 
than M 2 . if r be not zero, it follows that the inheritance of fertility marks a progres- 
sive change. The only means of counteracting its influence would be the reduction 
of M 2 to or below M x by the action of other equally potent factors of evolution. For 
the existence of such factors in man I shall later give evidence. 

(4.) Proposition III. — -To extend the results obtained for fertility to the problem of 
fecundity. 

While the fecundity of an individual can often, at any rate approximately, be 
measured, the fertility is not ascertainable. Thus we can ascertain the number of 
occasions on which a brood mare has gone to the stallion and the number of foals she 
has produced, but her fertility, the produce she might have had, if she had 
throughout her whole career had every facility for breeding, is unknown to us. But 
if we proceed to form tables for the inheritance of fecundity, we are met by precisely 
the same difficulties as in the case of fertility. The more fertile individuals are & 
priori more likely to appear in the record, and will be likely to be weighted again 
with their fertility when we come to deal with their offspring. * 

Now it is certain that fertility must be correlated with fecundity ; or, if x now 

represents the fecundity and /the fertility, we shall have for the mean fertility for a 

given fecundity x an expression of the form X + Kx, always supposing the regression 

to be sensibly linear. But the fertility must vanish with the fecundity, hence X = 0, 

and Xx is really the ratio of mean fertility to mean fecundity. Thus we may w T rite for 

the fertility f 

f— XiOj + £, 

where £ may vary widely, but it is not correlated with x. 

If now all the symbols we have used with regard to fertility in Section (2) be inter- 
preted as referring to fecundity, we must weight with, a factor \f instead of a factor 
Xxc, or with a factor XX r x + X£. So long as this factor is linear, absolutely no change 
can be made in the results, for, £ being uncorrelated with x 9 all summations including 

* In the case of sires especially, if we are dealing with thoroughbred horses, their comparative 
fewness at each period renders it qnite impossible to deal with one offspring of each parent only. 
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S (£) vanish. Thus all the values given for M' l5 M' 2 , cr' u and cr' 2 remain the same, if 
their results be interpreted in the sense of fecundity and not fertility. If p be the 
correlation between fecundity and fertility, and cr u cr 3 the standard deviations of 
these quantities, then \ = por 3 /(Ti ; but we have seen that it is also the ratio of mean 
fertility to mean fecundity. It follows accordingly that p is the ratio of the coefficient 
of variation in fecundity to the coefficient of variation in fertility. If we may judge 
by the cases of man and horse, so far as I know the only cases in which fertility and 
fecundity have yet been examined, a coefficient of variation in fecundity amounts to 
about 30 per cent., while one in fertility is something like 50 per cent. Thus the 
correlation of fertility with fecundity would be about '6. We should expect it to 
have a high value, perhaps even a higher value than this, In the case of thorough- 
bred horses, p will be the correlation between fecundity and apparent fertility. By 
direct investigation in the case of 1000 brood mares I find its value to be *5152. 

Passing now to the correlations r, r', r" s I observe that the proof given for fertility 
is valid with but few modifications, if these be fecundity correlations (see p. 266), for 
the proof involves no expansion of the factor (X^ + £) 2 . Hence we conclude that the 
regression coefficient for the inheritance of fecundity will not be modified by the 
nature of the record or the weighting of individuals with their fertility. 

When we come to the last series of constants, M"j, M" 2 , cr" u cr'^, we find that these 
will be modified, owing to the presence of the square factor {\x + £) 2 , although 
£ is not correlated with x. The term £ 2 now comes in, and S (£ 2 ) will give the 
standard-deviation of an array of fertilities corresponding to a given fecundity, i.e., 
S (£ 2 ) = 0-3(1 — p 2 ) X number in the array. # 

I find after some reductions that M" 2 and M" x are given by 

1% 1 2 



M" __ TV/T j ^1 / -1 I „ a I 1 . x \ { ' \ 

1 — JW-x -f- ^ ^tj" ail(p 2 Mf)J ••••■•• l xlx '/> 

the correlation of fertility and fecundity being now introduced into the results. 
Clearly the result (xvii.) 

M" — M' 

V77r~---~n7 = coefficient of regression ...... (xx.) 

M \ — M j ^ x ' 

still remains true. 

For the remaining two constants <y\ and <x" 1? I find, after some rather long 
analysis in the second case, which it seems unnecessary to reproduced 

* Should the regression not be linear, <7 3 \/(l — p 2 ) is the mean of the standard -deviations of the arrays. 
f In the course of the work the squared standard-deviation of a fertility array is assumed to be the 
same for all arrays ~ a\ (1 — ^r), and \ x is given its value /^ 3 /<x le See, however, the previous footnote, 
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//2- 2 

O" 2 = 0"2 1 



1 + r' 



o 



//2 



<J\ 



. I Jv.JSl.I« I, 



"2 
O" 1 



r^i I * - S ) + y 



'2 



1 _ -* -* 



Mf 



_£ °x . s {(« - M\m .. 



• , • 



M 2 + o- 2 
and y is' the factor "-_.;:- 9 -/ 9 > or as we can write It 

y = m\/(m\ + g ^ x 



If p be unity or near unity, i.e., fecundity very closely correlated with fertility, 
y = 1, the second term vanishes and (xxii.) becomes identical with the corresponding 
fertility formula (xvi.), just as (xxi.) is already identical with (xv.). 

Thus we see that the whole series of fecundity relations are strikingly like those 
for fertility, except that in certain of them — those for M 7/ 1? 'M" 2 , <r"i and cr" 2 — the 
correlation p of fertility and fecundity is introduced. If p be considerable, all the 
remarks we have made on the fertility formulae may, mutatis mutandis, be applied to 
the measurement of fecundity. 



(5.) Proposition IV. — To deduce formula; for finding the correlation between any 
grades of kindred from the means of arrays into which the kindred may be grouped. 

This problem is of very great practical importance. In the case of Man, families 
are so small tha,t there is comparatively small difficulty in forming all the possible 
pairs of brethren, say, for any family ; but when we come to animals or insects where 
the fertility may be extremely large, it is practically impossible to form a correlation 
table involving 50,000 to 100,000 entries.* One thoroughbred sire may have 50 to 80 
daughters, and thus give us roughly 1200 to 3200 pairs of sisters to be entered in 
a correlation table. Still higher results occur in the case of aunts and nieces. It 
may be asked why we do not content ourselves with one or two pairs from each 
parent ; the answer is simple : we have not (e.g., in the case of thoroughbred animals, 
pedigree moths, &c.) a great number of sires, and the sire with 50 offspring cannot, 
for accuracy of result, be put on the same footing as the sire with only 2 to 4. Our 
process is really an indirect weighting of our results. 

(A.) To find the coefficient of correlation between brethren from the means of 
the arrays, 

Let x be the measure of any character or organ in one brother (sister), and x that 
of a second brother (sister) : let m be the mean of one set of brothers, and m of the 

* Even with the reduction . in labour, introduced by this proposition and by the use of mechanical 
calculators, Mr. Leslie Bramley-Moore and I took practically a week, of eight-hour days, to deduce 
two coefficients of correlation, after the means of the arrays had already been found. 
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second set. Let n be the number of brothers in an array, and therefore \n (n — 1 ) the 
number of pairs of brothers in the array. Let cr and cr' be the standard deviations 
of the two sets of brothers, and r the coefficient of correlation between brothers for 
the organ in question. Let S denote a summation with regard to all pairs of brothers 
in the community, and 2J with regard to all brothers in an array. Let N be the 
total number of brothers in the community. Then if we selected our pairs of brothers 

* 

for tabulation at random (e.g., not by seniority or other character), we should find 
m = m and cr' = cr. Further, by definition of correlation 

Nro-o-' = S (a - m) (x' ~ m) = S3 (a - M + M - m) (x f - M' + M' - m), 

where M and M' are the means of the two sets of brothers in any array and are 
clearly equal. 

Further, 2 (x — M) = S (x — W) = 0, when summed for an array, and 
2 (x — M) (x — M') — 0, for there is no correlation within the array when the 
deviations are measured from the mean of the array. Hence : 

NroV = S {%n (n - 1) (M - m) (W - m')}, 
or 

Nro- 2 = S { \n (n - 1) M 2 } - 2mS {£w, (n - l) M} + m 2 N ; 



but 



Thus, finally, 



This can be written 



S {%n(n — 1)M} = Nm. 

_ 8 U n ( n - 1)M 2 }/K -m 8 , ... , 



/ „«, QH / (J" 9 & e # 9 ^ e 9 1 .A. -Jv JL V • J 



where cr a is the standard deviation of the arrays concentrated into their means and 
loaded with their sizes ; <x is the standard deviation of all brethren loaded with the 
number of times they are counted as brethren ; m is the mean of all the offspring 
loaded with the number of times they are counted as brethren. 

Let cr Q be the standard deviation of offspring, and p the correlation between parent 
and offspring, then the standard deviation of an array of offspring, if correlation be 

sensibly linear, # will be cr Q \/(l — p 2 ). We have, further, 

m = S (x) = S2 (a - M + M) = S {> (n - 1) M}, 

Na- 2 = S(x - mf = SZ(x ~-M + M ~~ mf ^ S {%(x ~™ M) 2 + £ra(n - 1)(M - m) 2 } 8 

But 

S(x- M) 2 = ^(n- 1) o- 2 (1 - p 2 ). 

* See, however, the first footnote p. 270. 
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Thus: 



or : 



No- 2 = Ncr* (1 - p 2 ) + Noi 



a 



and r may be written : 



oi (1 — - p 2 ) + crl (xxv.), 



a 






a 



*l (1 - p 2 ) + a* ' 



(xxvi.). 



Here cr a can be found from the arrays, and <x and p will in many cases have been 
previously ascertained. 

(B.) To find the correlation between "uncles" and "nephews" ("aunts" and 
" nieces") from the means of the corresponding arrays. 

Let n x be the number of uncles in an array, n 2 be the number of nephews in the 
associated array, so that n{a 2 is the number of pairs of uncles and nephews provided 
by the associated arrays. Let N = S (n x n 2 ) be the total number of pairs of uncles 
and nephews in the community under consideration. Let x be the measure of the 
organ or character in the uncle, x in the nephew. Let M and M' be the means of 
two associated arrays of uncles and nephews respectively. Let m and m be the 
means of all uncles weighted with their nephews and all nephews weighted with 

their uncles respectively, and let <x, a be the corresponding standard deviations 
under the same circumstances ; r the correlation of uncle and nephew. Then : 

NrW = S (x - m) (x - m') = SS (a? - M + M - m) (x - M' + M' - m). 

Now S (x — M) = £ (x — M') =. 0, and within the arrays there is no association 
of individual uncles with individual nephews, i.e., 2 (x — • M) (x — M') = 0. Thus : 



NrW = S {n Y n % (M — m) (M' — m')} = S (^MM') - Nmm', 



since 



Thus : 



m = S (n^ 2 M)/N, m' = S (%n 2 M')/N. 



S (w^aMMO/N - mm' 



crcr 



(xxvii.). 



If cr a and cr' a be the standard deviations of the means of the arrays of uncles and 
nephews and R the correlation of these means, the numerator is clearly Hcr a cr' a . 



Thus : 



r' = R %£f 

aor 



• • • . • 



(xxviii.). 



Here the numerator as a whole or in parts is easily found from the means of the 
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arrays. If cr and cr' be the means of unloaded uncles and nephews, we note that 
they are arrays owing to common parentage, and hence their array standard 
deviations* 5 '" will be o* \/l — p 2 and a s/\ — p\ p being the standard deviation of 
parent and offspring. As before we find : 

CT 2 = CTa + CTq (l -~ p ) "| 

> (xxviii.). 

cr' 2 =z cr'l + <?'l (1 — p 2 ) > . . , . . J 

If, as will probably be the case, there be no secular change between uncles and 

nephews, then cr = cr', cr a = cr a9 cr = cr' , and accordingly r = Rcr 2 /cr 2 ; whence, using 
(xxiv.), w r e have : 



9 9 

at & 



r 



I /\ AAi /\ ^ — 9 »••••••• I XaIXi I, 



<*a <? 



If we could assume <j a = <x tt and a = cr, this result would reduce to the very 

simple form : 

r f = r X R. 

Now the assumption cr = cr is, I think, legitimate, for the distribution for an 
unloaded array of nephews or uncles should be sensibly that of an array of brethren. 

But the equality of cr a and ar w which would now involve that of a and cr, is a much 

more doubtful point. <r a and cr a mark indeed quite different systems of loading. 
Both, it is true, are of the form 

S (nriW) / N - { S (nriM) / N } 2 , 

but in the case of brethren ri = \ (n — 1) or ?*■' has perfect correlation with n, while 
in the case of uncles and nephews ri is only imperfectly correlated with n. The 
intensity of this correlation depends upon the correlation between the sizes of arrays 
of uncles and nephews, a quantity which may be very small, or not, according to the 
nature of the record. Hence it appears necessary in applying the method to make 

some attempt to appreciate the value of cr a as well as cr a . If this be done R can be 
found from (xxix.), if not- directly. This value of R is not without importance for 
the inheritance of characters latent in one or other sex. 

We have thus reduced the correlations of individuals to a calculation of the corre- 
lation of arrays. 

(6.) Proposition V. — To find a measure of the effect of mingling uncorrected 
matericd with correlated material. 

The importance of this investigation lies in the fact that death, restraint, or other 

* Or, again, the means of the standard-deviations of the arrays. 
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circumstances, completely screen, in a certain number of cases, both the potential 
fertility and the real fecundity of man. Precisely similar circumstances, which will be 
considered more at length later, hinder our obtaining in horses a true measure of 
fecundity for all cases. We are thus really dealing with a mixture of correlated and 
apparently uncorrected material. In what manner does the influence of this 
mixture effect our results ? 

Let a group N consist of n x + n 2 + ^3 + n^ pairs of individuals. Of these, in the 
case of % pairs, both individuals have the true value of the character under investi- 
gation recorded ; in the case of n 2 pairs, neither have the true value recorded ; in the 
case of n 3 pairs, it is the first individual of the pair which has a true recorded value, 
and the second an apparent or fictitious value ; lastly, in n± cases, let the fictitious 
value be in the first and the real value in the second individual of the pair. Then 
there will be no correlation between individuals in the groups n 2 , n z , r& 4 . Let r be 
the correlation in the group % and E that observed in the whole group of 
N = % + w 2 + w 3 + n 4 . Let x be the measure of a character in the first, x in the 
second individual. Let M and M' be the means of the total groups of the two 
individuals and 2, % f their standard deviations. In group n x let the corresponding 
quantities be rn u m\, o^, a\, and a similar notation hold for the other sub-groups. 
Then m 1 = m 3 and cr l = cr 3 ; m 2 = <m 4 and cr 2 = cr 4 ; while m\ = m' 4 and cr\ = <t\ ; 
m 2 = m' 3j cr' 2 = cr' s . 

We have at once : 



M 



m L -f m 2 + n s + n^ n x + n 2 + n z -f % 



? 



while 



M' 



% + % + % + ^h 



Further 



(% + n 2 + n 3 + n 4 ) SS'R = S (sc — M) (a?' — M'), 



by the usual properties of product moments 

= w^oV + n x (m x — M) (m\ — M') + ti 2 (w 2 — M) (m 2 — M') 

+ n s (rri a - M) (m' 8 - M') + n 4 (m 4 — M) (m' 4 - M') 

= %< r i0"*V + n 1 m, 1 m\ + n 2 m 2 m 2 + n s m d m 3 + n±m$n \ 

— M (nxm'i + ^2^2 + Wgm'a + n 4 m' 4 ) — M' (n^ + ^2^2 + ^ 3 w 3 + n 4 m 4 ) 
+ MM' (n x + n 2 + ^3 + %) 

= n&ycr^r + n^m 7 ! + n 2 m 2 m 2 + n z m z w! z + rc 4 m 4 ra' 4 — MM' (% + n 2 + ^3 + ^ 4 ). 

Substituting the values of M and M' and using the relations between the m's, we 
find after some reductions : 

2 N 2 
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N22'R = n^oV + 



n„n A — n,n, 



^"i 



l /c, 2 



% + n 2 -f % -f % 



(m 2 — m 2 ) (m 7 ! — m 2 ) , . (xxx.) 



1 ... 1 

Let — of the N first individuals and — ~ of the N second individuals have fictitious 
P ( 1 

?J — ■ 1 Q — 1 

values, then — — N and -=—— N will have their true values. If, now, there is no 

P 9. 

correlation between the fictitious values in the two cases, we have at once : 



Wi 



= (g - i) (g - i) 



N, 



n 2 = — N, 



n =* — *N, 



tf — 1 , T 

P2 



From this it follows at once that 



n s Ui 



iun 



l"2f 



or the second term in (xxx.) vanishes. Thus 



R = 



% CTjCTir 



N 22' 



9 # * * 



. (xxx.) bis. 



Thus R vanishes with r, and no spurious correlation could arise from the existence 
of fictitious values distributed at random through the correlation table. This result 
might, indeed, (as it often is tacitly) be assumed by some, but it seems very desirable to 
have a definite proof. 

It remains to consider 2 and 2'. We have : 



or 



N2 2 = n x cr\ + n % a-'\ + n z a\ + n&i 

+ ??! (m 2 — M) 2 + 7i 2 (m 2 — M) 2 + w 8 (m 3 — M) 2 + n 4 (m 4 ■ 

= (wi + n 8 ) o- 2 + (w 2 + w 4 ) o| 

+ («i + w 8 ) m? + (^2 + n i) mi — (% + ^2 + ?h + ^4) M 2 ? 



*?! + n, t 2 Wg + % o , % + % % + % / v 



«*«>. 



N 



= 1 



N 
1 \ 1 

) °"l + ~~ 0*2 ~H 

PI P 



N 



1 



= ^ + -r( 



(T, 



P 



o?) + (l 



1 



P / P 
1 



(w*! — m 4 f 



P 



(mi — m 4 ) ; 



o » 



Similarly : 



"2 



crl + y (cr' 



o-l) + 1 



I 
2 



2 



M) 1 



(xxxi.). 



(m'i — m' 2 ) 2 . • (xxxii.). 



Now if the introduction of the fictitious values consisted of anything of the 
nature of a wrong pairing of certain individuals, we should simply have <Xx = cr 2 , 
cr 7 !— a-' 2 , m x = m 2 , m^ = 7??/ 2 and, accordingly, 2 = crj, and 2 7 = or\. 
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In any case, if the percentage of fictitious values be not large, the second and third 

1 1 

terms are of the second order of small quantities, since — and — are small. The 

P 2 

maximum value of the third term cannot be greater than ^ (m l — - m 2 )'\ and this 

will be relatively small in the cases to which we shall apply it. 

For example, no great changes are made in a-, when we vary the amount of 

fictitious cases introduced into our fertility tables. m x and m 2 do, however, change. 

Thus o"! = <x 2 = 3 approximately, and the range m Y -— m 2 = 1*2. Hence : 

2 2 — 9 + £ (1 *2) 2 , at a maximum, = 9*36, 

or, 

2 — 3-06. 

Thus in this extreme case there is only 2 per cent, change in the value of 2. In 
such cases accordingly we may take for rough approximations 2 = a and 2 = or'. This 
leads us to : 

N x ' 

Or, the reduction of correlation, due to the introduction of fictitious values, is 
obtained by using as a factor the ratio of actual correlated pairs of individuals to the 
total number of pairs tabulated. 

This result will be of considerable service when we come to deal with the fecundity 
of thoroughbred racehorses. 

(7.) 'Proposition VI.— To obtain a measure of the spurious correlation apparently 
existing between two organs, when a mixture is made of heterogeneous materials. 

Let x and x f be measures of the two organs, and let there be N pairs of organs 
formed by i heterogeneous groups containing n u n 2 , n 3 . . . pairs with means m u m' l9 
m 2 , m' 2 , m 3 , m' 3 . . . , &c, standard deviations cr 1? <r' u cr 2 , cr^, or 3 , cr' 3 . . . , &c, and 
correlations r l9 r 2 , r 3 . . . , &c. Let M L M' be the means of the whole heterogeneous 
community, 2, 2' the standard deviations, and R the correlation. Then : 

R22'N = S (wcrcrV) + S {n (m - M) (m' - M')}, 

where S denotes a summation with regard to all i groups. Now if there were no 
correlation at all between the organs in any one of the i groups, R for the hetero- 
geneous mixture would still not be zero so long as the second summation did not 
vanish. This, then, is a measure of the spurious correlation produced by making 
a mixture of uncorrected materials. 

Now S {n(m — M) (ni ~~ M')j, remembering the values of M and M' may be 
written : 
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S 






(m p — m 2 ) (m'p — m' q ) \ . . . . . . (xxxiv.) 



where the summation S now refers to every possible pair p and g of the r groups. 

Now it is very unlikely, unless i be very large and the numbers n u n 2 , n s ... be 
taken at random, that this expression will vanish. Suppose even that the means 
of our heterogeneous groups were uncorrelated, i.e., S (m — M) (m' — M') = 0, it is 
unlikely that S{n (m — M) (m — M')} will also be zero, when n is taken at random. 
With a comparatively few groups, with numbers taken at random, it is extremely 
improbable that the principal axes of the i points loaded with n u n 2 , n z . , . will 
exactly coincide with the directions of the axes of x and x\ 

We are thus forced to the conclusion that a mixture of heterogeneous groups, 
each of which exhibits in itself no organic correlation, will exhibit a greater or less 
amount of correlation. This correlation may properly be called spurious, yet as it 
is almost impossible to guarantee the absolute homogeneity of any community, our 
results for correlation are always liable to an error, the amount of which cannot be 
foretold. To those who persist in looking upon all correlation as cause and effect, 
the fact that correlation can be produced between two quite uncorrelated characters 
A and B by taking an artificial mixture of two closely allied races, must come rather 
as a shock. # 

The better to illustrate this, I take some data recently deduced by Miss C. D. 
Fawcett. She finds for 806 male skulls, from the Paris Catacombs, the correlation 
for length and breadth '0869 db "0236, and for 340 female skulls, from the same locality, 
— '0424 ± '0365. The existence of the negative sign and the comparative smallness 
of the correlation, as compared with the probable errors, might lead us to assert the 
correlation between the length and breadth of French skulls to be sensibly zero. 

If now the two sexes be mixed, the heterogeneous group has for correlation 
"1968 ± '0192, a value which cannot possibly be considered zero. Thus the mixture 
exhibits a large spurious correlation. 

Whether any given mixture increases or reduces the correlation will depend 
entirely on the signs of the differences of the means of the sub-groups. But the 
clanger of heterogeneity for the problem of correlation will have been made manifest. 
If the value of R for any mixture, whose components are known, is to be calculated, 
then we have only to note that : 

<2 _ S ( n ^) , S (n p n Q (m p - m q f) ^, 2 _ S (no*) S (n p n q (m' p - m'J) / x 

"""" N" "N" 2 ? "~~" N* 1ST 2 ^aaav.j, 



* Thus the mere fact of breeding from two or three individuals selected at random can easily produce 
a correlation between organs in the offspring, which has no existence in the species at large. 
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II. On the Inheritance of Fertility in Mankind. By Karl Pearson, F.R.S., and 

Alice Lee, B.A., B.Sc. 

(8.) In commencing an investigation of this kind where the results to be expected 
were quite unknown to us, but where we had reason to believe that the apparent 
strength of inheritance must be very small, we considered that the first thing to be 
done was to investigate the largest possible amount of material. Thus the probable 
errors of our results would be very small and any, however small, correlation between 
fertility in parent and offspring would be brought to light. Attempts might then be 
made to strengthen any correlation discovered by removing so far as possible one 
after another the various factors tending to screen the full effect of the inheritance 
of fertility. 

Such factors are for example : 

(a.) The age of both husband and wife at the time of marriage. The real fertility 
may be screened by late marriages of one or both parents. The relation of fertility 
to age at marriage has been dealt with by several writers, notably by Duncan and 
Ansell.* 

(6.) The duration of marriage. The data may be taken from a marriage not yet 
complete, both parents being still alive. Or from a marriage which is complete one 
or both parents being dead. In the former or the latter case the marriage may be 
complete so far as fertility is concerned, i.e., details of offspring may be available till 
the wife has reached the age of 50 years, which for statistical purposes may be taken 
as an upper limit to fecundity. 

(c.) Restriction of fertility during marriage. It has been shown in a paper on 
Reproductive Selection!" that there is evidence of the sensible influence of this factor 
in man. It tends to give fictitious values to the fertility of the younger, rather than 
the elder generation 3 and so obscures the correlation. 

We have accordingly two problems before us : 

(i.) Supposing these and other factors tending to screen the effects of reproductive 
selection to exist, can we show that it still produces sensible effects in the case of 
man, and thus demonstrate that fertility is really inherited ? 

(ii.) Can we by eliminating these factors so far as possible obtain a lower limit to 
the coefficient of heredity in the case of fertility, and ascertain whether it 
approximates in value to what we might expect from the Law of Ancestral 
Heredity ? 

The first impression of the reader may be that it is only needful to select the 

* J. Mathews Duncan, 'Fecundity, Fertility, Sterility and Allied Topics,' second edition, 
Edinburgh, 1871. Charles Ansell, Junr., ' Statistics of Families in the Upper and Middle Classes/ 
London, 1874. 

f ' The Chances of Death and other Studies in Evolution,' vol. I, pp. 77, 89, 
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fertility of marriages, which were formed with husband and wife between 20 and 
28* say, and which have lasted till the wife is over 50. But these conditions must 
be true in tivo successive generations, and, had we adopted them, we may safely say 
that without immense labour it would have been impossible to collect even a 
thousand cases. From the whole of the peerage, the baronetage, the landed gentry, 
a variety of family histories, of private pedigrees, and a collection of data formed of 
families at first hand, it was not possible to extract more than about 4000 cases for 
the inheritance of fertility in the female line, when the limitations were far less 
stringent, being applied only to one generation, and consisting in our taking marriages 
entered into at any time of life for either husband or wife, and lasting till the death 
of one member or for at least fifteen years. Even in this case the pedigree of the 
wife had to be sought for from one record to another and often in vain. It is the 
male pedigree with which the recorder in nearly all cases occupies himself. 

Only those who have attempted the labour of extracting, as has been done in this 
case, some 16,000 separate returns, will fully grasp the difficulty of making the 
limitations of selection more and more complex ; the quantity to be obtained becomes 
dangerously small and the labour immensely increases. Even could with time and 
patience a sufficient selection of ideal cases have been made, it does not follow that 
the result would be satisfactory ; for, we should have made a narrow selection, and 
this very fact might indicate that possibly we have been selecting one grade or class 
of fertility. It is possible that the less fertile are the weaker, and so more liable 
to die early ; or again it may be the more fertile women who are subjected to the 
more frequent risk of childbed, and thus are less likely to appear in the selection of 
long marriages. Even greater or less risk at birth may be an inherited character 
in women, and may not unfairly be looked upon in itself as a factor limiting fertility 
naturally. 

Taking these points into consideration, it seemed that if we were to have enough 
material to draw conclusions from we must entirely drop all attempt to classify by 
age of parents at marriage. We might make some limitations but they must not be 
very stringent ; they must leave room for an increase of stringency in different 
directions, so that we could roughly appreciate the influence of the screening factors. 
Accordingly our plan has been to show that correlation actually does exist between 
parent and offspring with regard to fertility, and that when we make the conditions 
more stringent the correlation increases towards the value indicated by the law of 
ancestral heredity. 

(9.) On the Inheritance of Fertility in Woman.— (i.) Table I. gives the result for 
4418 cases of the fertility of a mother and of her daughter. These were extracted 
from Foster's 4 Peerage and Baronetage/ Burke's e Landed Gentry/ some family 

* As Duncan points out, an early marriage on the average means an earlier cessation of fecundity; a 
somewhat later one does not necessarily connote less fertility. 
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histories atid a collection of family data drawn from private pedigrees and other 
sources. In the case of the daughter, no marriage was taken which had not lasted 
at least 15 years, or until the death of husband or wife. In the case of the mother 
no limitation whatever was made, the number of brothers and sisters of the daughter, 
including herself, being counted. Weight was given to the fertility of the mother, 
for every possible case that could be got from the records under the above conditions 
was extracted. It is quite possible that a certain proportion of offspring dying in 
early infancy have not been entered in. the records. 

If M m , Mtf be the mean fertilities of mother and daughter, cr m , <r d their standard 
deviations, and r md their correlation, we found : 

M d = 3:494, M*— .6-225, 

' <r d = 2*975, ' cr w == 3*052, 

r ml = -0418. 

Clearly owing to the near equality of cr d and <r m the regression of daughter's on 
mother s fertility is sensibly equal to the correlation. 

The probable error of r md is determined by the formula given by Pearson and 
Filon* to be -0101, or r md is four times its probable error. 

We thus conclude : 

(i.) That fertility is inherited in the female line. 

(iu) That its effects are very largely screened by the factors to which we have 
previously referred. 

Had we started with no limitation as to the daughter's family, it is highly probable 
that r rad would scarcely have been sensible relatively to its probable error, and, 
therefore, small series without due regard to screening causes may easily lead the 
recorder to suppose that fertility is not inherited. 

Supposing we exclude from the daughters the 775 barren marriages, we find the 
mean for 3643 cases of fertile marriages to be 5*237. Comparing this fertility with 
the observed fertility 6*225 of mothers, a superficial inquirer might at once consider 
that a diminution of fertility has taken place. The fact is that neither of the results, 
M m or M. d gives the actual fertility of the mothers or daughters. These are the 
means W\ and M" 2 of formulae (viii.) and (vii.) of the theoretical investigation. 

Let us apply the theory developed to our statistics. In the first place we note 
that r is small ; hence r 2 is still smaller, and thus by (xv.) <x" 2 will not differ much 
from cr 2 . Since cr f \ will be generally less than o^ by (xvi.), it follows that <x" 2 will 
probably be less than cr 2 . Approximately, we can take <r 2 = 3. Turning to (vii.) we 
see that M" 2 cannot, since r is small, differ widely from M 2 . If there be no secular 

# "Contributions to Theory of Evolution.-— IV." 'Phil. Trans,,' A, vol. 191, p. 242, 
VOL. CXCII. — A, 2 O 
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evolution in the real fertility sensible in the one generation, then Mj would equal 
ML Hence to a first approximation we should have ; 

M l -M 2 = M , ' 2 = 3-494. 

To obtain a second approximation we may substitute this in the small terms of 
(vil). Here <j'\ must be found from (ii.) ; neglecting the cubic term we have ; 

crl/cr? = 1 - o?/Mf = °2628. 

TT 

Mj = M 2 = 8*494 — r~( 1 + 7-^7^) 

= 3*494 — 9 0418 X 2*5759 X 11513 
3-494 - -124 = 8-370. 



We can now substitute this value of Mr in (viii.), and we find ; 

M"x = 8-370 + 2-980 = 6*850, 

This differs comparatively little from the actually observed value, 6*225, and is 
satisfactory evidence of the validity of our theory. The fact that the elder generation 
was in no way limited like the younger, and that we have neglected the third 
moment — although fertility distributions are never normal— as well as made other 
approximations, is quite sufficient to account for the difference observed. 

We may take it that 3*4 is practically the fertility of the elder generation, and 
that this is raised to about 3*5 by reproductive selection in the younger generation. 
The result 6-2 for the elder generation is thus purely a result of weighting due to the 
nature of the record, 

(ii.) Table II. gives the result of 1000 cases taken from the Peerage, Here the 
conditions of extraction were as follows :~— 

One member only was taken out of each family, or no weight was given to the 

fertility of mothers. 

The daughters' marriages had all been completed by the death of one parent or had 

lasted at least 15 years. 

There was no limitation with regard to the parents' marriages. We found ; 

M* = 3-923, M m = 5*856, 

o- d = 2*758, cr 1B = 2'751, 

The coefficient of regression is sensibly equal to that of correlation. The probable 
error of r dm = '0204, or not a tenth of the value of r dm itself. Again we conclude 
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that fertility is certainly inherited in the female line. By selecting fairly homogeneous 
material with a more definite and complete record than exists for the heterogeneous 
material of the previous case, we have carried up the correlation to five times its 
previous value, and within a reasonable distance of the value *3 which would be 
required by the law of ancestral heredity. The homogeneity of our material is 
evidenced by the reduction in both standard deviations ; the greater completeness of 
the record by the rise in the fertility of daughters ; and the non-weighting of the 
fertility of mothers by the fall in their mean fertility. 

If the reader will turn back to the theory of the influence of heterogeneity on 
correlation in section (7), he will notice that the expression in (xxxiv.) will be negative, 
and therefore the apparent correlation less than the real, if we form a mixture of two 
groups in which m v > rn q and m p < m r Now the entries of women in the Landed 
Gentry and other records are very often entries of " heiresses," while the entries of 
women in the Peerage are entries because of class. An " heiress" naturally has fewer 
brothers and sisters than another woman on an average, or we may expect m q > m r 
On the other hand an " heiress " need not have fewer children than other women, 
unless her heritage is the result of her coming from an infertile stock, and is not a 
result of the incompleteness of her parents' marriage. If she belongs to a somewhat 
lower social grade, she may possibly be more fertile than the average of a higher 
social grade. In this case m. p will be > m q9 and when we come to mix records of the 
Peerage with those of the Landed Gentry and Family Histories, we need not be 
surprised to find the correlation of fertility much weakened, as it undoubtedly is (as 
shown by (i.) and (ii.) above) by the mixture. 

Let us next apply our theory to the above results. We are now dealing with 
M' l5 M' 2 , cr' i, c/o- Assuming that there is no secular change 07 = <x 2 < and accordingly 
since cr\ ■•=■ o*^ sensibly, formula (xi.) shows us that both = <r\. 

Further, if Mi == M 2 , formula (v.) is a quadratic equation to find M L ; substituting 
for M' 1} cr'i, and r, we have, on solving and taking the only admissible root, M^ =2 3*4625. 
Then, applying formula (i.) to find M^, we have : 

M\ ='5-660. 

This is not quite as high as the observed value 5*856, but it suffices to show that 
our theory expresses the main facts. In all probability we have not entirely freed 
our results from weighting with fertility ; because, although every endeavour was 
made to take only one from each family, it is possible that pairs of sisters have occa- 
sionally crept into the record. 

(iii.) Table III. gives the result of 1000 cases taken from the Landed Gentry. As 
we have already noted, the women recorded are largely "heiresses," and we believed 
this might be one of the chief sources of the heterogeneity of the material in Table I. 
The conditions of selection were made somewhat nlore stringent, and were as follows : — 
Only one daughter was taken from each family, and her marriage must have lasted at 

2 o 2 
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least .15 years. No limitation was placed on the duration of the parents' marriage. 

We found : 

M d = 4-282, M wl = 5 9 403 5 

(T c i '■^--' O Zi %} Zj - @°m ~~~~ O Z4fc i. j 

The probable error of r md = '0211, and again we see that fertility is certainly 
inherited. The correlation has, however, sunk ; probably, as the great increase of 
variation indicates, because we are dealing with much more heterogeneous material than 
in the case of the Peerage. While the selection of " heiresses ?? has largely reduced 
the number of brothers and sisters, i,e>, the fertility of mothers, the limitation to 
marriages of at least 15 years has increased the apparent fertility of daughters ; nor 
is this increase at all balanced by the fact that heiresses come from small families, and 
may, therefore^ be supposed to be the children of rather sterile mothers. The average 
number of children of heiresses is sensibly as large as the average number of children 
of women who are not in the bulk heiresses, and who have, as in the following case, 
been selected with the same condition as to duration of marriage. The fact is that 
heiresses are not on the whole the children of sterile mothers ; their high fertility and 
their small correlation with their mothers shows us that heiresses in the bulk are 
rather the daughters of mothers whose apparent fertility is fictitious. They have, 
owing to the sterility or early death of their husband, to their own marriage late in 
life, or to some physical disability, or other restraint, never reached their true fertility. 
If this conclusion be correct, and a comparison of the values of M d and r md in this and 
the following cases thrusts It almost irresistibly upon us, then we see that the 
argument against the inheritance of fertility based upon the fertility of heiresses and 
non-heiresses is of no validity. # It could not be valid as against the values of the 
correlation we have found, but the present investigation shows by the value of r md 
exactly wherein the error lies : the heiress is not infertile, but is the daughter of a 
fictitiously infertile mother. 

Applying our theory to this case, we find from formula (xi.), putting crj ■-== c 2 : 

a- j = (a'l .— rVi)/(l — r% 

whence we.find a Y = 3*293, a result sensibly identical with or. d . Solving the quadratic 
(v.) with M\ = M 2 to find Mi we find .:.. ■--. •- 

.LYJL [ ^— • o o Zi i 

Hence by (i.) we have: - .-.-.-■ 

M 7 != 6'838, . 

the actually observed value being 5*403. Thus the theory completely falls to give 
the fertility of the heiresses' mothers ; for such a fertility as we find in the daughters, 

* Sec, for example, a recent letter of Mr. Howard Collins in ' Nature,? November 3, 1898. 
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the mothers' fertility is far too low. This again emphasises the point we have 
already referred to. The peculiar character of the selection, which leads to the 
female record in the Landed Gentry, is not one such as we have considered in our 
theory, where the record of an^y family is likely to appear in proportion to its size. 
Such a distribution is a chance distribution, but a selection of women inheriting land 
has not this character, and a woman who is the mother of co-heiresses is hardly 
doubly as likely to appear as the woman who is mother of one. A marriage in either 
case is likely to be arranged, and if we take only one daughter from each family the 
record will not already have weighted — at any rate to the full extent — every mother 
with her fertility. If the reader will compare the variation columns for both daughters 
and mothers in Table II L with the corresponding columns in Table II. or Table IV., 
he will at once see how anomalous is the selection of women given in the Landed 
Gentry. 

(iv.) Table IV. gives the results for 1000 cases taken from the Peerage and 
Baronetage under the following limitations ; one daughter only was taken for each 
mother, and in the case of both mother and daughter the marriage must have lasted 
at least 1 5 years. We found : 



** = 


= 4-335, 


M w = 


5-898, 


<Td ~ 


= 2-967, 


i 


2-830, 




t*mtl " 


= '2130. 





The probable error of r md = *0204. Thus, as it is now hardly necessary to repeat, 
fertility is certainly and markedly inherited. The regression coefficient is now as 
high as *2233, the closest limit we have yet reached to the theoretical '3 of the law 
of ancestral heredity. 

Owing to the limitation to marriages of 15 or more years, the means of the 
fertilities of both mothers and daughters have risen, in the latter case more, how- 
ever, than the former. It might have been expected that the fertility of mothers 
would have risen more, but it must be remembered that M m is the apparent and not 
the real fertility of mothers ; and further, since the record largely weights the more 
fertile women, the bulk of the mothers are already those with large families, i.c<> 
those whose marriages have lasted at least 15 years. 

Assuming that there is no sensible secular change in unweighted fertility. Le* } 
ar l = o- 2} we have from the formula on p. 284 : 

o-! '== 2'973. 

From (v«) with M t = M 3 we find : 

M t = 8'845 

for the real fertility of mothers. This is a sensible increase on the value 3*463 given 
in Case (ii.), in which there was no minimum duration to the length of the mothers 
marriage. 



286 PEOFESSOR EARL PEARSON AND MISS ALICE LfciE, 

Applying formula (i.) we find : 

M / l = 6-144, 

which is somewhat more than the observed value 5*898. The reason for this lies, 
we think, in the difficulty already referred to on p. 263. If we start extracting 
mothers, it is often difficult to follow the daughter's history ; starting with the 
daughter it is much easier, although still laborious, to trace back her ancestry, and 
find the number of her brothers and sisters. Even in this case the search may be 
lengthy. But as daughters when married change their name, it requires great care 
in extracting large quantities to be sure that a mother is not repeated, i.e., some 
approach made to weighting her with her fertility. Every care was made in 
extracting the records, but we cannot hope to have always avoided weighting to 
some extent a mother, and if this be done we shall have a transition from formula) 
(xi,), (v.), and (i.) towards formuke (xv.), (vii.), and (viil), which would well account 
for the difference found between theory and observation. 

If we sum up for inheritance of fertility in the female line on the basis of these 
four cases, we draw from each one of them the unquestionable result that fertility in 
woman is an inherited character. Further, the more we remove causes of fictitious 
values for the fertility in either generation, the closer does the value approach that 
required by the law of ancestral heredity. The two chief disturbing factors which 
we have not been able to eliminate are (a.) the age at which marriage is entered 
upon, (6.) restraint giving a fictitious value to the fertility. Both these causes must 
give a lessened value to the correlation of fertility between mother and daughter, 
and the first, judging from the great influence of age at marriage on fertility, cannot 
fail to give a serious diminution. Hence if we find the regression coefficient as high 
as '2233, when we neglect these factors, it is no stretching of facts to conclude that 
it would in all probability rise to *3 could we take them into account. 

Our conclusion, therefore, is that fertility in woman is certainly inherited through 
the female line, and most probably according to the law of ancestral heredity. 
Reproductive selection is actually a vera causa of progressive change, but its 
influence is largely, if not entirely screened by the numerous factors tending to make 
the apparent fertility of women differ from their real or potential fertility, 

(10.) On the Inheritance of Fertility in Man, 

(L) While many of the difficulties involved in the extraction of data for women 
still exist for man, a new and important feature tending to screen the full influence 
of the law of ancestral heredity arises in his case* The full fertility of the husband 
is not in the average case at all approached in' the case of moriogamic marriage. 
Hence i in considering the size of a mail's family as a measure of his fertility we 
are measuring a character which differs largely from the character of * fertility in 
woman. - It is only in the case of sterile or even very sterile men that there is 
likely to be a correlation shown between the sizes of the families of fathers and sons. 
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The intensity or duration of fecundity in the husband must, one or other, be less 
than that of the wife, — and this will hardly be so in the great run of cases — if his 
family is to be in any way a measure of his fertility, or, as it might be better 
to call it in this case, his sterility. We are seeking to find a correlation between 
two characters, one in father and one in the son. neither of which we can measure 
unless they fall short of a certain limit. The result is that our correlated material is 
weakened down by the admixture of a mass of uncorrelated material in the manner 
indicated in Proposition V. of the theoretical part of this investigation. Within the 
family we cannot hope to get a correlation which will approach that indicated by the 
law of ancestral heredity, We may still, howevei% hope to ascertain whether 
fertility, respectively sterility, is an inherited character in man as well as woman. 

(ii.) Our first attempt was to collect as much material as possible, so that our 
limitations were few, The Peerage, Baronetage, Landed Gentry, Family Histories, 
private pedigrees, and collected data provided the 6,070 cases arranged in Table V. 
Here large families were weighted because several, where available, were taken from 
one family. The son's marriage must either have lasted till the death of one partner 
or at least 15 years; there was no condition as to the duration of the father's 
marriage. 

We have spoken of the correlation between fertility of father and son, but since 
only a single marriage of the father is taken, it may be equally well termed a 
correlation between the fertility of the mother and son, which may, perhaps, to some 
extent explain the relatively high values reached. 

Let M s , cr s be the mean and standard-deviation of the son, M p , cr p of the parent, 
and r sp the correlation ; then we found : 



M s = 3-871, 


M,= 


= 5-831, 


cr s = 3-008, 


<T V ~ 


: 3-190, 


' sp ~~" 


•0514. 





The probable error of r yp = '0087. Thus the correlation is nearly six times the 
probable error, or fertility in man is certainly inherited. 

(iii.) Table VI. contains the result of extracting 1,000 cases from the Peerage, only 
one son being taken from each family, and his marriage having lasted at least 15 
years. No attention was paid to the length of parents' marriage. 

We found : 

M, = 5-070, M p .= 5*827, 

cr, = 2-910, cr p = 3-142, 

r rp = -0656. 

The probable error of r sp = '0212. This case closely confirms the previous case; 
M. ? and or p remain sensibly the same, M, has risen owing to the longer period of 
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duration of the son's marriage, and since there is a longer period for the possible 
exhaustion of the male fertility, we find r, p is slightly larger. Although the numbers 
are smaller than in Case (i.), the probable error is not so large but that we can still 
assert an inheritance of fertility in man. 

(iv.) Lastly, to compare with Case (iii.) for women, 1000 cases were extracted 
from the Landed Gentry, and are given in Table VIL Here no marriage of the son 
or parents was taken under a minimum of 15 years duration, and only one son 
taken from each family, We found : 

■ M,= 5-304, M. P = 6-272, 
cr, = 2-951, or,, == 2-911, 

r, ? = '1161. 

Thus the longer duration of the marriage, which gives a greater chance for the 
exhaustion of the fertility of a partially sterile father, leads to an increased corre- 
lation. The probable error here is '0210, and the correlation is thus unquestionable. 

It would be idle to apply the theory before developed to these male cases, for the 
simple reason that we must certainly look upon them as containing a large proportion 
of uncorrected material. But they suffice to show that male fertility is an inherited 
character, and although the results are widely different from those indicated by the 
law of ancestral heredity, they are large when we consider how little male fertility 
appears measurable by the results of monogamic marriage. Were an approximately 
close measure of male fertility available, there is certainly in the above results no 
reason to induce us to believe that it would not be found to obey the law of ancestral 
heredity. 

(11.) On the Inheritance of Fertility in Woman through the Male Line, 

Although we are not able to measure the potential fertility of the male, we are 
able to determine whether he transfers fertility from his mother to his daughter. 
This may be simply done by correlating the fertility of a woman and that of her 
paternal grandmother. This problem belongs to an important class— namely, 
questions as to the extent to which a sexual character is inherited through the 
opposite sex, Darwin has touched upon this " transmission without development " in 
Chapter viii. of the 4 Descent of Man/ # and we shall find his views amply verified. 

The problem before us is : Does a woman have as close correlation with her paternal 
as with her maternal grandmother in the matter of fertility ? 

To solve this problem 1000 cases were taken out of the Peerage for the fertility of 
a woman and of her paternal grandmother. The marriages of the woman and of her 
grandmother were both taken with a minimum duration of fifteen years. Everv care 
was taken that no weight should be given to fertile families by taking only one out 

* Second Edition, p. 227, l Laws of Inheritance.' 
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of each famity, but, of course, the difficulty of avoiding this is increased when a 

pedigree must be traced through three instead of two generations. 

If d denotes granddaughter, g grandmother, the following results were obtained 

(Table VIII.) :— 

M d = 4*411, M, = 5*657, 

cr d = 2*897, <r g = 3'056, 

do ' — 1 l«0» 

The coefficient of regression of daughters fertility on grandmother's fertility = '1065. 

The probable error of r dg = *0211. 

According to the law of ancestral heredity 46, we should expect the grandparent al 
correlation and regression to be half the parental and equal to '15. Comparing the 
present result with Case (Iv.). we see that '1123 and # 1065 have to be compared with 
-£(•2130) and |>(*2233) 5 or with *1065 and "1116. These are differences well within 
the probable error of our results, or we may conclude that the correlation of a woman 
with her paternal grandmother is exactly what from Case (iv.) of Section (9) we 
should expect to find for her correlation with her maternal grandmother. The reduc- 
tion from *15 to '1123 is just what we might have predicted after the maternal 
reduction from *3 to *2130. We, therefore, conclude that the fertility of woman is 
inherited through the male line with the same intensity as through the female, and 
this intensity is most probably that which would be indicated by the law of ancestral 
heredity. 

(12.) We do not stay to consider many points which flow from our tables, such, for 
example, as the amount of restraint indicated by the hump at the start of our various 
frequency distributions for size of families, partly because such consideration would 
lead us beyond our present scope, the inheritance of fertility, and partly because this 
point has been already dealt with by one of us in a paper on ' ^Reproductive Selection.' 
We consider that we have shown fertility in mankind to be an inherited character in 
both lines, and probably obeying the law of ancestral heredity .t By aid of our 
theoretical investigations it is clear that the average size of a family (M x ), as deduced 
from our record data (M^ or M^). is about 3 '5 children, if the marriage lasts till the 
death of one partner, or at least till 15 years; it is about 3*9 to 4 children if the 
duration of the marriage is at least 15 years. Reproductive selection would increase 
this average by about *5 child per generation were its influence not counteracted 

* "Mathematical Contributions to the Theory of Heredity, on the Law of Ancestral Heredity," 
£ Roy. Soc. Proc.,' vol. 62, p. 397. 

t In the paper on " The Law of Ancestral Heredity " (' Roy. Soc. Proc.,' vol. 62, p. 412) it is stated 
that fertility is probably inherited, but the amount falls below that which would be indicated by the 
law of ancestral heredity. At that time only Case (i.) of Section (9) and Case (i.) of Section (10) had 
been worked out in detail. It is the rise of correlation with more stringent limitation of opposing 
influences, which suggests that after all that law is true for fertility as for other characters. 
VOL. CXCTT. — A. 2 V 
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by a variety of other factors of evolution. These factors are so active that the 
influence is reduced to '12 of a child per generation if we take Case (i.) of Section (9), 
and, we have little doubt, would be practically insensible did we take all marriages 
without any limitation whatever. Reproductive selection must, therefore^ be looked 
upon as always tending to increase the fertility of a race ; races are not only ever 
tending to increase, but tending to increase the rate at which they increase—a feature 
not recognised by Malthits, but which strengthens certain of his arguments. So 
soon, therefore, as environment, or other circumstance, relieves the pressure of 
opposing factors, a race will not only increase in numbers, but also in fertility. It is 
this inherited character of fertility, and its constant tendency to change unless held 
in check by natural selection or other factor of evolution, which seems to us the 
source of the immense diversity in fertility to be observed not only in different species, 
but in local races of the same species. 

III. On the Inheritance of Fecundity in Thoroughbred Racehorses. By Karl 
Pearson, F.E.S., with the assistance of Leslie Bramley-Moore. # 

(13.) The data provided for the fertility of thoroughbred racehorses by the stud- 
hooks, are of a kind which cannot be hoped for except in the cases of pedigree animals 
kept for breeding purposes, and of specially-arranged experiments on insects, &c. 
We have a practically complete record of the stud-life of every brood-mare. The sire 
by whom she has been, covered in each year is stated, and the result, barren, dead 
foal, living filly or colt, twins, &c, can be ascertained. It is also possible to find out 
whether the foal dies young, say as a yearling. By examining the whole series of 
stud-books the complete pedigree of any mare or sire can nearly always be found, 
and the correlation theoretically worked out for almost any degree of relationship. 

In starting an investigation of this kind on such a great mass of raw material, 
it is necessary to draw up certain rules for the extraction and arrangement of data. 
These rules must be prepared without any definite knowledge of the character of the 
material in bulk, for this can only be found after, perhaps, some 1000 cases have 
been extracted and worked out. Hence the rules originally adopted are often not 
such as an investigator would have arranged had he known beforehand the general 
character of the conclusions he would reach. But the statistician cannot, like the 
experimental physicist, modify without immense labour his methods and repeat his 
experiment. The collection of his data has frequently been far too laborious a task 
for repetition. His raw material has been prepared in a certain manner ; he may 

* During the three years in which this investigation has been in progress, a considerable number of 
friends have given me substantial aid in the arithmetical work, or in the preparation of the 6,000 
pedigree cards on which the results are based. Mr. Bramley-Moore has latterly been my chief helper, 
but I am also much indebted to Miss Alice Lee and Mr. G. U. Yule. Miss Margaret Shaen and Miss 
Ltna Eokenstein have also contributed to the labour of extracting the raw data from the stud-books. 
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sort and rearrange his data cards in a variety of ways, but to prepare new cards on 
a different system is practically beyond his powers. 

These remarks are made in order to meet criticism of the method in which my 
data cards were prepared. I could now possibly extract more convenient data, but 
that is only because of the knowledge gained in the process of examining the 
fecundity of several thousand horses. I did not even know, ab initio, the extent of 
variability in equine fertility ; I did not even know the immense preponderance 
which would have to be given to certain sires, at any rate I had no numerical 
estimate of it. Nor had I any percentage of the number of cases in which a 
pedigree might end abruptly with an alternative sire. # 

I saw at once that the apparent fertility of racehorses was even less close to their 
potential fertility (which I presume to be the inherited character) than in the case 
of man. Mares go at different ages to the stud, they remain — -for reasons not 
stated — uncovered for occasional years, or periods of years ; they return to the 
training stable for a time; they are sold abroad; they are converted into hunters, 
put into harness, or, as is occasionally recorded, sold to cab proprietors. This by no 
means invariably denotes that their fertility is exhausted ; their offspring may be 
bad racers, or their stock unfashionable. Very frequently also Ave find the mare put 
to a cart-horse stallion, for a year, a few years, or for the remainder of her career, 
and then no record at all is given of the result. Thus the total fertility recorded 
can have but small correlation with the potential fertility, and I was compelled to 
deal with fecundity. The insufficiency of the apparent fertilities, as recorded in my 
mare index, to solve the problem, may be illustrated in the following manner : 
1100 cases of the apparent fertilities of mares and dams having had at least four 
coverings were tabulated (Table IX.). The following results were calculated from 
this table, the subscript m referring to mare and d to dam :— 

M d = 7*6655, M w = 6-1391, 

<r d = 3*3652, o- m = 3*1617, 

r md = - -0868. 

The probable error of r md = *0202, and thus we might argue that a fertile dam 
has, on the average, infertile offspring. But an examination of the above numbers 
shows us that the dams are more variable than the mares,t and yet the dams have 
been theoretically subjected to the greater selection, for they must all be granddams, 
or the fertilitv of the mares could not have been recorded. We are forced to 
conclude that the mares have been in some manner selected, and the form of the 
selection is fairly obvious on examining the table. There appears a great defect of 

* Even the pedigree of such a famous racehorse as Grladiateur is soon cheeked by the occurrence of 
alternative sires. His sire, Monarque, was the son of either The Baron, or Sting, or the Emperor. 
f The variability of mares, as a whole, not separated into mares and dams, is (see Art. 16) 3*2775. 

2 p 2 
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mares in the third quadrant/"' i.e., of mares and dams of large fertility, the frequency 
is cut off abruptly in this quadrant. The reason for this is fairly clear. We have 
dealt with a limited number of years, about 30, of horse-breeding ; hence, when the 
dam has a long record, her later offspring at any rate cannot possibly have a long 
one ; when she has a short one, it is possible for them to have a long one. Accord- 
ingly, there has been a process of unconscious selection, which has led to a negative 
correlation of the apparent fertilities. 

To illustrate the point further, two more correlation coefficients were obtained. In 
Table X. are given the apparent fertilities of mares and their dams with a minimum 
of eight coverings. We find : 



L b 



M € < = 8*6191, M, a = 7-6309, 

or. = 3*1656, o-,, = 2-8149, 



' niil ™™** "™" UO# U. 



mil 



The probable error is again about "0202. While the mares now form a group with 
their mean fertility almost equal to that of the dams in the previous result, their 
variability is markedly less. Relatively to the dams its reduction is even greater. 
The correlation is sensibly the same. It would thus seem that the anomalous 
selection of mares which thus reduces their variability so markedly below that of 
the dams is not in the low fertilities. 

I now removed from the Table IX. all parts of it concerning mares with a fertility 
greater than 8 ; 867 mares and dams remained with a minimum limit of four 
coverings, the mares not having a greater fertility than 8 offspring, I found : 

M d = 77636, M w = 4*8558, 

<r d = 3*3983, or m = 1-9887, 

r = — •019(h 

The probable error of r = *0229. Now the line of regression for dams on mares 
ought to be the same, whether we obtain it from this result or from the first results 
in which mares with more than 8 offspring are included. Yet, in this case, there is 
no sensible correlation at all. In other words, if we exclude the data for large 
fertilities, we should have to conclude that there was no correlation between the 
apparent fertilities recorded for mares and their dams. We are thus forced to conclude 
that apparent fertility is a character depending on the manner in which the record 
is formed, and must be. useless for the investigation of inheritance. This investi- 
gation strengthens my a priori reasons for selecting fecundity, not apparent fertility, 
as the character to be investigated. I took the fecundity of a brood-mare to be the 
number of her living offspring divided by the potential number of her offspring 
under the given circumstances. Of both numerator and denominator of this ratio 
I must say a few words. 

* The portion of the table cut oil by vertical and horizontal lines through the means of dams and mares, 
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In considering the inheritance of fertility I had two different problems in my 
mind : (i.) Is fertility pure and simple inherited ? i.e., Does a very fertile mare have 
offspring more fertile than the average? And (ii.) What effect does reproductive 
selection actually have on the population ? i.e., To what extent is it screened by 
other factors of evolution ; does the very fertile mare actually have more offspring 
than the less fertile ? Is, for example, her stock weedy and likely to die early ? In 
the case of mankind, the fertility of a woman is, as a rule, effectively brought to 
its limit with the end of her marriage, and accordingly I started with completed 
marriages. In the case of a brood-mare her effective fertility depends not on the 
offspring she has but on the number of these which survive foaldom. It would doubt- 
less have been better to have treated these two problems of fertility separately, but 
being fairly confident from Proposition I., p. 260, that fertility must be inherited, I was 
more interested to test the actual effect of reproductive selection. Accordingly I 
selected as the numerator of my fecundity ratio, not the number of foals born, but 
those who survived to the yearling sales. The difference is not very great, but quite 
sensible. For example, the mean fecundity of 3909 brood-mares, measured in my 
way, = '6343, i.e., 63 surviving offspring on the average of 100 coverings. 

The following table gives the result of reckoning merely barren mares and those 
slipping foals or giving birth to dead foals in a twenty-year period :— 

Average Fecundity of Brood-mares. 



Year. 
1873 


Average fecundity. 


Year. 


Average fecundity. 


•712 


i 

1883 


•693 


1874 


•703 


1884 


•678 


1875 


•707 


i 1885 


•702 


1876 


•697 


1886 


•700 


1877 


•692 


1887 


•682 


1878 


•680 


1888 


•695 


1879 


•683 ! 


| 1889 


•685 


1880 


'666 


1890 


•686 


188 L 


•680 


1891 


•679 


1882 


•667 

j 


1892 


•675 






i 





The averages of five-year periods are ; 



•675, 



'691, 



•684, 



•702, 

and of the whole period, *688. 

There does not appear to be sufficient evidence for any secular change here, and 
we may take '688 to represent the average fecundity of the brood-mare, reckoning 
viable offspring to the number of coverings. The difference of '688 and *634 gives 
a death-rate of 5*4 foals in 68*8, or a death-rate of 7*85 per cent, of foals before 
maturity. If a considerable part of this death-rate be differential, we have room 
for natural selection influencing the drift of reproductive selection. The standard 
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deviation in the fecundity is, however, about '191, or about 19 foals in the 63, or 
about 30 per cent. — a very great variation, so that if fecundity be inherited, a 
differential death-rate of the immature will hardly suffice to check it. 

So much then of the numerator of my ratio. I have spoken immediately above of 
the denominator as if it were the number of times the mare had been covered. 
It is generally this, but in the relatively few cases where the mare has given birth 
to twins, I have counted that covering' twice. Had this not been done the fecunditv 
might have been greater than unity, for example even in some exceptional cases 
have risen to two. On the other hand, a loss of twins would have been marked by 
no greater change in fecundity than a loss of one foal, or the survival of one twin 
w^ould not have been different in its effect on fecundity to the birth of a foal. In 
order, therefore, to avoid these difficulties—especially that of isolated individuals 
lying far beyond the fecundity range of to 1— when twins were born the poten- 
tiality of the covering was reckoned in the denominator as two. The relative 
infrequency of twins causes, however, this modification of the denominator to have 
small influence on the result. 

My next step was to form some estimate of the extent to which fecundity thus 
measured was the same for different periods in a mare's breeding career. I expected 
fecundity to diminish with age as in the case of mankind, but taking out a fairly 
large test number of mares, I found that their fecundity for the periods covered by 
two successive stud-books was in the majority of cases closely the same. With larger 
experience I should now lay more weight on the decrease of fecundity with age ; and 
I also think fecundity is smaller when the mare first goes to the stud. But even 
thus much of the reduced fecundity of old mares seems to arise from breeders sending 
famous mares to the sire long after their breeding days are passed. I have several 
records of old mares being covered seven or eight times without offspring. This 
custom of breeders was much more rife in the early days of breeding than it appears 
to be now, when some breeders discard or sell a fairly old mare, even if she is barren 
two or three successive years. Clearly the custom gives the mare a fictitious 
fecundity, far below her real value, and probably accounts for granddams having a 
somewhat less fecundity than their granddaughters. 

The next problem to be answered was the effect the method of forming my 
fecundity ratios might have on the relative numbers which would be found in 
different element-groups. For example, supposing the element of fecundity to be 
1/10, or the element-groups 0-1/20, 1/20-3/20, 3/20-5/20, . . . 17/20-19/20, 19/20-1, 
would the fact that the fecundity ratio is a ratio of whole numbers cause, & prion, a 
greater probability of frequency in one of these element-groups than another ? 

To begin with, all estimation of fecundity based on less than four coverings was 
discarded. Three coverings give too rough an appreciation of a mare's fecundity, it 
can only fall into one of the values 0, 1/3, 2/3, and 1. The question then arises, if 
all the fecundities : 
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0/4, 1/4, 2/4, 3/4, 4/4, 

0/5, 1/5, 2/5, 3/5, 4/5, 5/5, 

0/6, 1/6, 2/6, 3/6, 4/6, 5/6, 6/6, 



• • • 



0/26, 1/26, 2/26, .... 25/26, 26/26, 

were equally likely, how would the frequency depend on the grouping ?* 

Taking 26 coverings as the probable maximum — it actually occurs — we have for 
the total number of fecundities given above : 5 + 6 + 7 + -««+27 = 368 separate 
fecundities. Let us see how they would be divided in one or two cases. 

Case (i.) Let the elements be based on 1/8, or be 0-1/16, 1/16-3/16, 3/16-5/16, 
5/16-7/16, 7/16-9/16, 9/16-11/16, 11/16-13/16, 13/16-15/16, 15/16-1. 

The half-groups at the ends are taken so that zero and perfect fecundity should 
really be plotted at the middle of a 1/8 element. We find, adding up the numbers 
of the above fecundities which fall into the nine groups, the following frequencies : — 

33*5, 42, 43-5, 44, 42, 44, 43*5, 42, 33'5. 

There is thus a somewhat deficient frequency in the terminal groups, and this 
would probably to some extent bias the distribution. 
Case (ii.) Let the elements be based on 1/15, or be 

0-1/30, 1/30-3/30, 3/30-5/30, . . . 25/30-27/30, 27/30-29/30, 29/30-1. 
We have the following distribution : 

zjo, 2i'2i, Zo 9 Jio'u 9 Js2i'u 9 Ito'uy Zo'U) ZtZy ZZ 9 Zio'D^ Zo'O, ZZ'O, Zo O, <Zo, ZZ, Zo. 

The bias here is only slight and the distribution is on the whole very satisfactory. 
Case (hi.) Let the elements be based on 1/20, or be 

0-1/40, 1/40-3/40, 3/40-5/40, . . . 35/40-37/40, 37/40-39/40, 39/40-1. 

We find for the groups : 

23, 13, 17*5, 17*5, 17, 18, 17, 18, 17'5, 17, 18, 17, 17'5, 18, 17, 18, 17, 17*5, 17*5, 13, 23. 

Here the terminal groups have too great a frequency, and the adjacent groups too 
little. It is clear that the division into 1/15 elements is better than those of 1/8 of 
1/20, so far as these results go. But unfortunately the different coverings do not 
occur in anything like the same proportions. Their exact frequencies could only be 
found a posteriori, and I was desirous of having some idea of grouping before start- 

* Such problems are really not; infrequent in statistical investigations, and seem to be of some 
interest for the theory of fractional numbers. Mr. Filon worked out for me the details of the cases 
given below. 



PROFESSOR KARL PEARSON AND MR. LESLIE BRAMLEY -MOORE, 

ing the labour of extraction. I therefore weighted the different coverings on the 
basis of a small preliminary investigation as follows : 

Case (iv.) Number of coverings, 4 to 5 inclusive, loaded with 2. 

6 to 9 ,, ,, 3. 



„ 10 to 15 „ „ 4. 



19 to 26 „ ,. 1. 

The resulting system of frequencies was : 
54, 42-5, 45'5, 47, 45*5, 46*5, 46, 45, 45, 46, 46*5, 45*5, 47, 45-5, 42*5, 54, 

This system is not so uniform as in Case (ii.). I had hoped that the 744 frequencies 
would have been fairly closely the double system of Case (ii.), The main irregularity 
occurs at the terminal groups, or those having fecundities nearly zero and nearly 
perfect. These I considered would be relatively infrequent, when we started with as 
many as four coverings, and had an average failure of about 37 in 100. The sequel 
showed that the assumption was legitimate, so far as regards zero fecundity, but that 
perfect fecundity was sufficiently frequent to cause a hump in the frequency curve 
for fecundity, corresponding to the group-element 29/30 to 1. The frequency of 
this group is greater than that of the group 27/30 to 29/30, when we start from at 
least four coverings. This hump entirely disappears, however, if we start with at 
least eight coverings. Thus I take the hump to be purely '■ spurious/' i, e., a result of 
the arithmetical processes employed, and not an organic character in fecundity. It 
depends upon our definition of fecundity, which is not a truly continuous quantity. 

As the theory of correlation applied is not in any way dependent on the form of 
the correlation surface, beyond the assumption of nearly linear regression, the hump 
cannot, I think, sensibly affect our conclusions. Had I known, however, a priori, 
what the frequency of different coverings and the nature of the fecundity frequency 
curve would be, I should have attempted to choose such a group-element, that, with 
proper weighting of the coverings, there would have been no arithmetical bias to 
the terminal groups. As it was, it seemed to me that the group-element of 1/15 
gave fairly little arithmetical bias — at any rate where the bulk of the frequency would 
occur— and it was accordingly adopted as a basis for classifying fecundities. 

The difficulty illustrates the point I have referred to, namely, that in statistical 
investigations the best classification can only be found d posteriori, but the classifi- 
cation adopted has usually to be selected d priori 

The 1/15 element being selected, the letters a, b, c, d, e } f, g, h, i,j, k, 1, m, n,p, q 
were given to the 16 groups of fecundities from to 1, as cited under Case (ii.)." 

* A table was formed of the 368 actually-occurring fecundities, from which, it was possible to at once 
read off the group (or it might be two groups, e.g., \5 falls half into h and half into i) into which they 
each fell. 
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Thus the fecundity of a mare was described by one of these 16 letters. Here the 
centre of the j group, for example, is # 6, and it covers all fecundities from '56 to '63, 
Thus midway between j" and h we are at about the mean fecundity. 

The more recent Stud-Books, vols. 12 to 17, were taken as containing more 
complete details and, what is more important, less in-and-in breeding, although as we 
shall see, this is still an important factor. These volumes cover 30 and more years 
of English^ stud life. From these 30 years' records upwards of 5000 mares, who had 
been covered upwards of four times, had their fecundity ascertained. The process 
was a very laborious one, as each mare had generally to be sought for in several 
volumes, and the records in each volume are not continuous, but overlap by quite 
arbitrary numbers of years. Further, great care had to be taken 'to identify each 
mare properly, as the same name is very frequently repeated, and the like difficulty 
occurs, though to a lesser extent, in the case of sires. A card was then written, 
giving the name of the mare and those of her sire, her dam, and her dam's sire. 
Upon this card the letter indicating her fecundity was placed. A card alphabet of 
mares was thus formed, consisting, in the first place, of about 3000 entries. This 
alphabet was again gone through and the fecundity of the dams of the mares inserted 
on the cards till there were about 2500 cases known of mare and dam. The dams 
were partly found from the existing series, but it was also largely necessary to work 
out fresh cases. Lastly, the cards were gone through and the fecundity of the grand- 
dams entered in upwards of 1000 cases. This forms the first series of cards. 

In the next place a card index was formed of. all the sires serving during these 
30 years. This contained upwards of 1000 cards. On these cards the sire's sire was 
entered, and the fecundity of all the mares contained in the first or mare alphabet 
was now taken off and placed on the card of the mare's sire. Thus the card of each 
sire had the letters a, b, c, d, e, &c, upon it, and a frequency distribution was formed 
on the card of each sire for the fecundity of his daughters. 

The same thing was done for the sires' sires ; only here recourse had again to be 
had to the stud-books to obtain the fecundity of the daughters of the more ancient 
sires. Finally, a sire-alphabet was obtained which gave the average fertility of the 
daughters of a sire and of the daughters of his sire, or his half-sisters. On these 
cards was also placed the number of mares upon which each average was based. 

These two card-alphabets, the mare and sire alphabets, form the " dressed " 
material upon which all the subsequent calculations were based. 

(14.) At this point it seems desirable to insist somewhat on the many causes which 
tend to make the fecundity of mares, as thus determined, to a considerable extent 
fictitious. Many of these were only apparent to me as I became more and more 
familiar with the material 

* Irish mares were excluded except where, for pedigree purposes, it was necessary to deal with them. 
Many Irish mares were further included when it came to the valuation of the fertility of mares due to 
a given sire. 
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(a.) Mares appear to be less fecund at the beginning and end of their breeding 
career. Hence, when the fecundity is based on a part only of their career, as it often 
must be, we do not really get a fair appreciation 

(Z>.) A more fertile mare is likely to have more daughters go to the stud than a less 
fertile one, and hence we get a better appreciation of the fertility of the offspring of 
the former than of the latter. 

(c.) Fashion among breeders interferes largely with the exhibition of the natural 
fecundity of a mare. She may be a famous mare and is sent to a famous sire, even 
though produce is not so likely as if she were put to a sire of a different class. This 
appears to be practically recognised when apparently barren mares are sent in one 
season to two, or even three sires, or again to half-bred horses or cart-horses. 

(d.) Brood-mares which have produced performers are kept much longer at the 
stud, and we have the fecundity lowered by coverings after the mare is sensibly 
sterile. Less important mares are removed sooner from the stud. 

(e.) Good racing mares are often put late to the stud. 

(f.) In a certain number of cases we are simply told that the mare had no produce 
for a period of years, but whether she was covered or not is unrecorded. 

(g.) Second-rate mares, or mares thought to be near the end of their fecundity* are 
often sold abroad. In the latter Qase the fecundity is fictitiously increased ; in the 
former we have only a short period to base it om 

(Ji.) There is no record kept of the half-bred foals, which for our purpose are as 
important as the thoroughbred foals. "Put to a hunter" is a not uncommon record^ 
with no statement of the result. 

(i.) Comparatively infertile mares, unless of very valuable stock or famous as racers, 
are not kept long enough at the stud to get a reliable measure of their fecundity. 

(/.) The smaller breeders will often put mares to inferior sires, already nearly 
worn out, either because they own them, or because their fee is low ; and thus again 
a full chance is not given to the fecundity of the mare to exhibit itself. 

(k) We have excluded in our determination of the fecundity foals dying young. 
This is often due to the fault of the mare, but is often again due to the environment. 

(/.) Lastly, thoroughbred mares are highly artificial creatures, and many must 
suffer from their environment,* either in the matter of barrenness or slipping foal, in 
a manner from which the wild horse or a more robust domesticated animal would be 
entirely free. 

These considerations may suffice to show that our values of the fecundity will only 
roughly represent what may be termed the natural fecundity, and we ought not for 

* I am told that there are like difficulties with cows. Cows are very liable to slip their calves, and 
one cow doing so, several others in the herd will or may follow her example. There is a strong folk- 
belief in Wiltshire — I give it merely as evidence of what a slight change in the environment is supposed 
to achieve — that the habitual presence of a donkey with the herd in some way soothes the cows, and 
renders them, less ready to slip their calves. 
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a moment to expect inheritance in the full intensity of the Galtonian law to be 
exhibited by such material. 

(15.) But there is another point of very considerable importance for the weakening 
of correlation, namely the effect of in-and-in breeding. To get correlation we must 
have a diversity of parents producing a diversity of offspring, but when the parents 
become more and more identical, we get larger and larger arrays between which and 
the parents the correlation is weakened. For example : suppose the correlation found 
between all parents and offspring in the general population, and now select only all 
the brothers in a large array and find the correlation between them and their 
offspring, we shall find that the correlation is lower than in the previous case.* It 
would be impossible to apply theory to the present case, however, because we can 
only roughly appreciate the extent of such in-and-in breeding. That it is great the 
following statistics will show. 

Of the more than 1000 sires in my sire alphabet, only 760 were sires of mares 
which had been covered at least four times. These 760 sires had upwards of 
5000 offspring, of whom I had the fecundity recorded, but when mares with 
alternative sires were excluded, there remained only 4677 available mares.t These 
mares were distributed as follows : — 



Daughters . . , . 


1 

280 


2 


3 

78 

15 

8 


4 
43 

16 
6 


5 


6 


7 
20 

19 

4 


8 
21 

20 
4 


9 
22 


10 
14 


10 


12 

10 


Oli vb • » * « t 


113 


29 

17 

2 


22 

18 
6 


Daughters ♦ . . . 


13 
11 


14 


Above 20 


Sires ..... 


11 


46 



Here the second line gives the number of sires having the number of daughters 
in the first line in the 4677 cases, which I take to be a fair sample. 

Thus over a third of the sires had only one mare. Two-thirds of the sires had 
together only one-fifth of the mares. Seventy-six of the sires were fathers of 
about half the mares, and 46 sires alone produced 1801 mares, almost as many as 
642 sires did. We are here dealing with the fairly long period of 30 years, but 
even making due allowance for young stallions commencing and old stallions con- 
cluding their stud career, it will be manifest that our sample shows that the great 
bulk of mares for the period in question were the offspring of comparatively few sires. 

But let us look at the problem from the standpoint of the sires. My 760 sires 

* The theory of such cases is fully developed in a memoir on the influence of selection on correlation 
not yet published. 

t Some other cases were also excluded for diverse reasons. 

Zi Q & 
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were all fathered among themselves except In 49 cases. In other words, they were 
the sons or grandsons of only 49 sires. Of these 49 sires, there were 12 whose 
pedigree I could not trace, # but they were very probably sons of sires already on my 
list or among the remaining 37. In the majority of cases they appeared only as 
the sire of one stallion. The remaining 37, whose pedigree I could trace ? were 
descended at once or in very few generations from 9 sires.t Thus both from the 
standpoint of the mares and of the sires we are dealing with a closely In-bred 
stock, and this is one and probably a very important factor in the weakening of the 
fecundity correlation. 

Having regard to these difficulties, if we can succeed in showing that fecundity in 
thoroughbred racehorses is inherited, we can be fairly confident that we have 
only reached a lower limit of the correlation coefficient* 

(16.) On the Inheritance of Fecundity in the Female Line, 

(i.) A preliminary investigation must here be made, in order to determine the p of 
the foimulse given in Proposition III. (p. 269) we want the correlation of fecundity 
with fertility. If cf) be the fecundity, f the apparent fertility, and c the number of 
coverings, twins counting as a double covering, we have : 

<£ = f/c, 
whence if we determine the correlation between <£ and J] numerous constants will 
follow. Table XI. gives the correlation between fertility and fecundity for 1000 
brood-mares. We found : 

M^= '6375, Kr= 6*515, 

cr^ = *1810, o-f = 3*2775, 
p = Ttf— '5152, 

^ = 100o*/M* = 28*39, v f = 100cr f /M. f = 50*31, 

where v^ and v f are the " coefficients of variation."! Here by Yule's Theorem § 
r^o-JcTf is the slope of the line which most closely fits the curve of regression for 
fecundity on fertility. If we supposed this curve to be straight, then the line must 
coincide with It. Now since fecundity vanishes with fertility, the curve passes 
through the origin, and hence, if the regression be linear, the line must also pass 
through the origin. In this case, as is shown on p. 270, r ¥ = v 4 Jv = '5644. The 
difference between *5644 and '5152 may be taken, as it Is several times the probable 
error, to indicate that the regression curve between fecundity and fertility is only 
approximately linear. 

The variations in both fertility and fecundity are here large. Accordingly we 

* Stockmar, Sovereign, Andover, Phaeton, Prince CarUdoc, Robert Houdin, Pylades^ King of Kent, 
Garry Owen, Calaban, Homily and Taurus. 

f Tramp, Sir Peter Teazle, Catton, Buzzard, Oville, Diouied* Sorcerer, Dr. Syntax, Marske* 
J 4 Phil. Trans.,' A, vol. 187, p» 2 76. 
§ 4 Roy. Soc* Proo.,' vol. 60, p* 477* 
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must use the formula (i.) for the mean value of an index given in my memoir on 
spurious correlation. 4 " We shall then obtain an approximate value to the mean 
number of coverings of each mare. Formulae (iii.) of the same paper will then give 
the standard deviation for the number of coverings. In our present notation : 

c = M, 
and therefore : 



M. 



M 
M. 






1 + 



100 



r 



M 



10,000/y 



ov 



M, 



V (los) 



+ 



It 

100 



We find 



and : 



\100; 

Mj/M* = 10-2196, 

M c = 10*2196 X 1-007 
cr c = 4*4455, 
u. = 43*20. 



— 9r y& 
J r * 10,000 



10-2911, 



To the same degree of approximation we can further ascertain the correlations 
between the number of coverings and the apparent fertility and fecundity, i.e., r cf , 
and r c ^ A short investigation similar to those in the memoir on spurious correlation 
just cited shows us that : 

. ** = ( w - %)K 

These lead to the numerical results : 

r c/ = -8259, r# = — '0572. 

The conclusions to be drawn from these results are all of some interest. In the first 
place we may ask : How does M^ agree with its value found from other and more 
complete series ? For 4677 mares— my complete series without mares with alternative 
sires— the average fecundity was '6373. A better agreement could not have been 
hoped for. In a group of 1509 mares dealt with for variation only and entered as 
S{ daughters" on the cards—so that they had not been selected by the fact that their 
daughters must have recorded offspring, as is the case with " dam ?J entries — I found 
the following results ; — 

Variation in Fecundity of 1509 Brood-maizes (Four Coverings). 



Fecundity. 


a. 
9 


b. 
3 


11 


a. 

26 


e. 
46 


/• 


85 


h. 


i. 
154-5 


y« 


194 


I 


m. 
146 


100 


p. 
23 


2- 
90 


Frequency 


43-5 


1225 


232-5 


223 






Total 1509, 


M^ = -6345. *+ = 1965. 











* ' Roy* Soc, Proc.,' vol. 60, p. 492, 
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Now this is precisely what we might expect ; the mares belonged to a class, of 
which we are not certain whether their daughters have or have not recorded fecundity. 
The mean fecundity is therefore decreased and the variability increased. Add to this 
group 2400 mares, all of which had had their daughters' fertility recorded, and we 
find for 3909 mares, M^ = '6345 and cr^ = "1910, i.e., the mean fecundity ascends 
and the variability falls. Illustration of this law will be found in the following two 
groups :— 



1200 mares ...... 

1200 dams 


'6337 
•6525 


H- 


•1888 
•1643 



Thus we send up the mean fertility and lower the variability by separating into two 
groups the pedigree of one which has a longer record. This is precisely in accordance 
with the theory already developed. Our mean fecundity and variability for brood- 
mares may be considered as constant characters, and variations in their values beyond 
their probable errors due to conscious or unconscious selection in the record itself, 
or in our extracting from it, 

The reader will notice at once, if he turns to the diagram of the above frequency, 
(i.) that there is a small hump at (a) -of no practical importance, and a larger one 
at (q), perfect fertility being fairly frequent with only four coverings, and there being 
from the arithmetical processes involved a bias towards (q) as compared with (p). 
(ii.) The distribution of frequency, although somewhat ragged, is quite clearly not 
normal, but of the character which in other papers I have called shew. Were there 
any occasion, it w r ould be easy to fit it with one of my skew curves. To mark 
how (i.) will disappear and (ii.) become still more apparent, I have placed on the 
diagram the frequency distribution for 2000 mares reduced to the same scale e 




t& & it 

Fecund iCy. 
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Variation in Fecundity of 2000 Brood-mares (Eight Coverings), 



Fecundity 


a. 




b, 

2 


c. 

7-5 


d. 


6. 

21-5 


55 


</• 


&. 


i. 


• 


A?. 


Z. 


m. 
204 


». 


49 


2- 
19 


Frequency 


11-5 


104-5 


182 


271-5 


315 


337 


293-5 


127 




Total, 2000. 


M* = -6330. H = -1568. 







Thus, making the minimum number of coverings 8 instead of 4, has removed 
the terminal humps, zero fecundity is now unknown, and perfect fecundity very rare. 
We have reached a smooth skew frequency distribution ; we see fecundity as a con- 
tinuous character obeying the usual laws of variation.* The mean fecundity in the 
two cases is sensibly the same, *633, but owing to the fact that we have made a 
selection of a limited group in the second case, the variability is considerably 
decreased. 

The average apparent fertility of brood-mares, 6*515, must not be confused with 
their average real fertility, for, as we have seen, we have in many cases not a com- 
plete record of their stud-life, or such a full record has not been, used (e.g., in case of 
mares still at the stud, but having been already covered four or more times). Its 50 
per cent, variation shows that an apparent fertility of 9 to 12 is not infrequent. 
The average number of coverings being 10 and more, it will be seen that the records 
of between 50,000 and 60,000 coverings have been dealt with to form our mare and 
sire alphabets. The large variability in the number of coverings shows that 15 
to 20 coverings will not be infrequent, and cases of 26 actually occurred. Lastly, 
we have the correlation between fertility and the number of coverings, high as 
might be supposed, for a high apparent fertility could only be exhibited by many 
coverings. Although a low apparent fertility might correspond to any number of 
coverings, still, in practice a sterile mare will not be sent indefinitely to the sire. The 
correlation between the number of coverings and the fecundity is small and negative 
( — '0572). This follows from the principle that, fertility being the same, a high 
number of coverings reduces the fecundity, and this factor is more potent than the 
high correlation of fertility and the number of coverings. 

(ii.) Table XII. exhibits the correlation of 1200 mares and their dams with regard 
to fecundity. Here the more fertile dams are weighted with their fertility, and at 
least four coverings were required of each mare. If the subscript m refers to mare, 
and d to dam, we find : 

# The actual equation to the curve referred to the mode '6531 as origin, the axis of x being positive 
towards perfect fecundity, and the unit of x being 1/15 is : 

y = 342-187 (1 + a?/47-1358) 83 ' 6261 (1 - aj/12'1106) 31 '* 291 . 

The fit will be found to be very satisfactory, 
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M ??i = '6337, Mei = '6525, 

<r w = '1888, <r d = 1643, 

r md = -0831. 

The coefficient of regression = *0945, 

The probable error of the correlation is '0193 and of the regression"* '0195. Thus 
these quantities are four to five times their probable errors, and we conclude that 
fecundity is certainly inherited. 

The intensity is far below that suggested by the law of ancestral heredity, but it 
nevertheless exists. Its lowness is probably due to the fictitious character of the 
fecundity owing to the causes indicated on pp. 298-9. An attempt must now be made 
to eliminate some of the factors disguising the fecundity, but to do so is by no means 
so easy as in the case of fertility in man. 

(iii.) My first idea was that by taking a higher limit to the number of coverings a 
closer approach might be obtained to the true, i.e., the inherited fecundity. 
Accordingly Table XIII. was formed for the correlation of 1000 mares and their 
dams, when the minimum number of coverings was eight. But I did not recognise 
that this would give far greater weight in the Table to the older mares, and that 
accordingly causes (d) and (i) of p. 298 would now play a much larger part in disguising 
the true fecundity than before. There appears to be no limit to the number of 
times a famous old mare may go to the stallion when there is very small hope of any 
offspring. 

Table XIII. gives us the following results : 

M w = -6300, M d = -6360, 

cr m = 1633, a d ' = '1500, 

The coefficient of regression = '0708. 

The probable error of the correlation is '0212, and of the regression '0213, both 
less than a third of the observed values. We should again conclude from this result 
that fecundity is inherited, although it offers less strong evidence than the previous 
case. The influence of selection! is at once apparent in the great reduction of the 
variabilities. The fact that we are throwing the determination of fecundity more 
on to the old age period of life appears from the reduced mean fecundities. I 
attribute the reduction in the fecundity-correlation to this source, i.e., the very diverse 
treatment which old mares receive at the hands of different breeders. 

(iv.) I made another attempt to remove screening causes by taking 1200 more 

* Pearson and Eilon : 4 Phil. Trans.,' A, vol. 191, p. 214. 

f The effect of such a selection as the above in reducing correlation is dealt with in my paper on the 
influence of selection on correlation? 
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mares, not identical with the series in (ii.)'* and working out their dams' records most 

carefully, rejecting any cases in which the breeder was clearly sending the mare to 

the stallion long after it was obvious (post facto) that she was sterile. In this case 

four coverings were retained as a minimum, and the results are given in Table XIV. 

We find ; 

M w = -6369, M d = -6616, 

cr m = -1885, <r d = '1604, 

V*md :==1 "Oy JD. 

The coefficient of regression = # 1169. 

The probable error of the correlation is '0193, and of the regression '0194 ; the 
correlation is accordingly more than five, and the regression more than six times its 
probable error. We conclude that fecundity is most certainly inherited. The 
regression found is, however, only about two-fifths of what is required by the law of 
ancestral heredity, 

(v.) It has been suggested that fertility or fecundity might alternate in two 
generations ; when the offspring are numerous their offspring might have less fertile 
or fecund offspring. I do not see how this would be possible without its exercising 
an influence on the correlation of two generations, for we must come to one fertile 
followed by an infertile generation. But I had made preparations in my alphabet of 
mares for testing the correlation between mares and their granddams, and I went on 
to the construction of a table, although the results for mares and their dams showed 
me that whatever result might be reached, it would be within the probable error of 
the observations. I reached this conclusion in the following manner : If we go back 
one generation we introduce, owing to the nature of the record, so much fictitious 
correlation and so much in-and-in breeding that the coefficient of inheritance is 
reduced to two-fifths or less of what its value should be according to the law of 
ancestral heredity. In going back two generations we come to fewer mares, to more 
in-and-in breeding, and to just the type of famous old mare, whose breeder kept her 
at the stud long after she was sterile. I expected accordingly a great and artificial 
fall in the fecundity of granddams and a double drop, something like f X f , in the 
value of the regression as indicated by the law of ancestral heredity. This would 
reduce the apparent regression to about f X f of '15, or to about '025, say, a value 
about equal to the probable error of the table. The results actually reached are given 
in Table XV., and we find, if the subscript g refer to granddam : 



VI = 


•6345, 


M„ = 


: -6232, 




•2040, 


°V = 


: -1687, 




' rug 


= -0169. 





The coefficient of regression = '0204. 

* In the first series the mares' names ran from A to G; in the second from G to M, with 300 
additions made to the A to G series, while I was completing my alphabet. 
VOL. CXCII. — A, 2 R 
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The probable error of the correlation is '0218, and of the regression '0213. Thus 
these results are not significant in themselves, but they are exactly what we might 
expect on the above hypothesis. Taken with the other five tables which we have 
worked out for the inheritance of fecundity , they are significant, for every one of 
them gives a positive correlation, however small it be, and thus adds to the accu- 
mulated evidence that fecundity is a heritable character. 

(vi.) It remains to test our results by the theory developed on pp. 269 et. seq. But 
a difficulty comes in here. Turning to (xviii.) and (xix.) on p. 268, we cannot feel 
justified in putting M x = M 2 , for there is a secular difference in the fecundity of mares 
and dams, owing to the fecundity of the older brood-mares being based on a longer 
period and liable to the disturbing causes so markedly manifest in the correlation of 
mares and granddams (see my remarks, p, 305). If we combine (xviii.) and (xix. 
we find 



M" 2 - M' 2 = r ^ (W\ - M',). 



°i 



Now r is small, and it will accordingly be legitimate to put M.\ = M' 2 and cr x = cr 2 
on the right, we have then 



(M" 2 - r'M" 1 )/(l -- r) = 

From this we deduce for the results in (ii.) on p. 804 

M' 2 =-6321. 
Turning now to (xix.), it may be written 



<r? / cr'\\a s - 



The second term in the curled brackets is small, and in it we may put to a first 
approximation <r\ = ar 1 = o- z and M x = M'g. We then have 

I \ V6321/ (-5132) 2 / 

or, 

W\ = M\ X 1*0666. 



Substituting the value of W\ we find 



M', = -6118. 



We thus see a difference in the fecundities of the unweighted dams and unweighted 
mares of *6118 and '6321, or about 2 foals more in the hundred appear to survive in 
the later generation. This is very probably due to the causes already indicated as 
affecting the apparent fecundity of the older mares (see p. 298). The influence of 
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reproductive selection changes these quantities to '6337 in the case of the daughters, 
and to the apparent high fecundity of '6525 in the case of the dams. 

We can now find cr 2 to a second approximation by aid of (xxi.). In the small term 
multiplied by r, we put cr l = <x 2 = cr" 2 . Hence we find 



ai = a"l + r 2 (<r'l - a"?), 



and deduce, on substituting the numerical values, 

cr 2 = -1896, 

or is scarcely different from cr" 2 . We accordingly conclude that we may quite 
reasonably assume the variability of the mares to represent the variability of the 
mares without reproductive selection, but the effect of weighting the dams with 
their fertility is to reduce the variability of the dams from about # 1896, if there be 
no secular change, to an apparent value as low as '1643. 

The same formulae applied to the slightly better results in (iv.) on p. 305 give us : 

M.\ = '6205, and M' 2 = '6342. 

If we pass back from M.\ and M' 2 to M x and M 2 we find : 




If these results be considered as valid, we notice a remarkable difference between 
the fecundity of the younger and elder generation. While the crude results on 
pp. 304 and 305 might lead us on first examination to suppose the elder generation 
more fecund than the younger, these results show us that it is distinctly less so. 
The greater part of the difference, however, is due, not to a secular change, but to 
the causes we have so often referred to as weakening the fecundity recorded for the 
older mares. At the same time the whole system of breeding is so artificial that we 
may well doubt whether our equations (i.) and (v.) can be legitimately applied. For 
the chance of a mare getting into the stud-book as a dam, i.e., having daughters at 
the stud, depends less on her fertility than on the degree of fashion in her stock. 
Thus the record weighting with fertility is hardly a probable hypothesis, and the 
values just given for M x are, I suspect, much below what they should be. For the 
above reason I have not proceeded to consider the changes in variability connoted by 
(ii.) and (xxii.). As I have made no attempt to form a correlation table for mares 
and dams in which the dam would have only one daughter to her record, I cannot 
make any plausible guess at the real magnitude of the cubic summation term in 

Ai JEi M 
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(xxii,). Apart, however, from the numerical application of these variation formulas 
to a somewhat doubtful case, we see in these formulae the theoretical basis for the 
observed fact that the fecundity of mothers is far less variable than that of daughters. 
It is really only an apparent divergence, due to the fact that the mothers have been 
weighted with their fertility ; this, while it increases the apparent mean of their 
fecundity, reduces its apparent variability. 

(17.) On the Inheritance of Fecundity in the Brood-mare through the Male Line. 

For the thoroughbred horse this problem is fairly easily answered b} 7 investigating 
whether mares related to the same stallion have any correlation between their 
fecundities. The two cases I have selected are : (i.) " Sisters," daughters of the same 
sire, but in general not of the same mare ; and then (ii.) "Nieces" and "Aunts," or 
daughters of a sire and the daughters of his sire. As we have only 760 sires and 
nearly 5000 mares, the daughters or aunts fall into rather large arrays, and we are 
compelled to use the methods discussed in Proposition IV., A and B. Even so the 
arithmetical work for a correlation based on the index of sires was far more laborious 
than for one based on the index of mares. 

(i.) To find the Correlation between Half- Sisters, Daughters of the same Sire. 

Here w r e have to use formulae (xxiii.), (xxv.), and (xxvi.) of pp. 272-273. In order 
to do this a table was formed of the mean fecundity M of the array of sisters due to 
each sire, and of \n{n — .1), the number of pairs of sisters in each array. Then the 
products \n (n — l) M and \n (n — 1)M 2 were formed, and the numerator of (xxiii.), 
or cr«, calculated by adding up for all the 760 sires. The result gave : 

arl = '6655167, 

where the unit is the fecundity group element of 1/15. The number of pairs of 
sisters dealt with was 54,305. The denominator a~o(l — p 2 ) + a-% is not so easily 
ascertained. <x is the standard deviation of all the series of mares who are sisters 
without weighting; cr v/(i — j° 2 ) * s the standard deviation of an array of sisters, or 
if the regression be not linear, the mean of such standard deviations for all arrays, 
or rather its square is the mean of the squares of such standard deviations ; p is the 
correlation between a patent character in the daughter and a purely latent character 
in the sire, and cannot therefore be found directly. 

In order to get an appreciation of the standard deviation of an array of sisters- 
it beino* practically impossible to work out these quantities for 760 arrays— I selected 
twenty sires having fairly large arrays of daughters, and reached the following 
results : 
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Table of Arrays of Mares, which are Half-Sisters. 



Sire. 


No. of mares. 


Mean fertility. 


S.D. of array. 


Macaroni 

King Tom 

The Duke . 

Rosicrucian 

Stockwell 

Y. Melbourne 


76 
52 
67 
64 
37 
81 
53 
41 
79 
87 
32 
35 
37 
58 
43 
59 
80 
40 
55 
41 


9-697 

10-750 
9-201 
8-875 
9-708 

10210 
9-689 
9-878 
9-437 
9-057 
9-125 
9-186 
9-297 

10-466 
9-267 

10-932 
9-131 
8-222 
9-064 
9-951 


2-989 
2'545 
3-176 
2-497 
3-076 
2-770 
2-748 
2-086 
3-003 
2-752 
3-314 
2-474 
3021 
2-621 
2-847 
3-094 
2-093 
2-201 
2-301 
1-651 


Totals 

Ditto in actual units* . 


1117 

55-85 

• • 


191-143 
9-55715 
•6371 


53-259 
2-66295 

•1775 



I next took the mean and standard deviation of the 1117 mares to obtain cr . 
The mean fecundity was now found to be 9'5685 and cr = 2*7824, or in actual units 
•6379 and *1855. Clearly only about ■§- per cent, difference is made whether we 
take the mean fecundity of the 1117 mares, or the mean of the unweighted means 

of the twenty arrays. Knowing cr and cr v/ 1 — p 2 we can now find p. We have 

almost at once 

p = -2900. 

This is probably the first determination of a coefficient of inheritance between a 
latent character in one sex and a patent character in the other sex. We see that it 
has almost exactly the value required (*3) by the law of ancestral heredity, or we 
conclude, mares inherit from their sires a fecundity governed closely by the law of 
ancestral heredity. 

If the reader asks why is not the intensity reduced in this case in the same 
manner that we find it reduced in the case of the inheritance from the dam, the 
reply is : 

(i.) In the case of the dam and mare, both quantities to be correlated are liable to 
fictitious values. In the case of sire and mare, we deal with only one. 

* A fecundity unit is taken to bo 1/15, for this is the unit of grouping. 
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(ii.) The influence of fictitious values has been shown on pp. 276-277 to chiefly 
affect the coefficient of correlation and not the standard deviation. 

Now the present result is based solely on the calculation of standard deviations, 
or on the variability of fecundity as a whole and in arrays. It is accordingly not 
influenced to nearly the same extent by the existence of fictitious values. Could we 
calculate the variability of the arrays of daughters due to individual mares, we 
should probably get a better result for inheritance in the female line. # 

The above result is so satisfactory that I have little doubt that we have deter- 
mined a very good value for cr \/l — p\ Substituting it we find for the correlation 

between half-sisters : 

-66552 



7-09130 + '66552 



vDuOt 



The law of ancestral heredity gives for /ia(/~sisters r = 9 2, and f- of this = *08. 

Thus we see that the collateral heredity between half-sisters, daughters of the 
same sire, is quite sensible, and is almost what we might have predicted would be 
the result, if we supposed correlation to be weakened, as in the previous cases, to 
f- of its value by fictitious records. 

It is worth while to consider the amount of fictitious fecundity suggested by 
the reduction factor f. We have only to suppose the n ly /N of our p. 277 to be f. 
Now we may well assume the chance of a fictitious fecundity being recorded to be 
the same for either one of a pair of sisters ; hence we shall have p = q 5 and therefore, 

from the result on p. 276, we find (p — i) 2 /p 2 = -§-. This gives us (p — l)p = \Z'4 f 
and (n x + w 3 )/N the fraction without fictitious values = (p — l)jp = '6325* Thus 
in order to introduce the reduction factor of f by the occurrence of fictitious values 
of the fecundity, we should have to suppose about 37 per cent, of fictitious values to 
occur. This is, of course, a sort of average ; many values will probably be only 
partially fictitious, i.e, P will to some extent approximate to their real values. 
Considering the very artificial character of the thoroughbred brood-mare, and the 
uncertainty of her treatment by breeders, this does not seem such an immense 
percentage that it would force us to the conclusion that the law of ancestral heredity 
cannot be true for the inheritance of fecundity, 

(ii.) To find the Correlation in Fecundity between the Sisters of a Sire and his 
Daughters, 

What we want is really the correlation between aunts and nieces, but they 

* The standard deviations for the arrays of mares in Table XII. were indeed worked out for the 
twelve cases of dams from e to q. The mean of these cases was sensibly the same whether the simple 
mean, or the mean weighted with the numbers in the array was taken, and equalled 2'8091 or T823. This 
is <r (1 — r 2 ) 1/3 . But by p. 48, <r == *1888, whence we deduce r == '1375, and the regression equals *1581. 
Thus we have found a substantially larger value for r than that on p. 301 by dealing with variabilities , 
and not direct correlations. This gives additional evidence, if any were needed, of the inheritance of 
fecundity. 
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are not " aunts " and " nieces " in the human sense, for the aunts are only half- 
sisters of the sire. By a process similar to that on pp. 408 and 409 of my paper 
on the "Law of Ancestral Heredity/'* I deduce that the correlation between a sire's 
sisters and daughters ought to be *05, and not '15 as in the case of Man. If this be 
weakened down to the § of previous results, we should not expect a result differing 
much from '02, As the variability of the elder generation is always less than that 
of the younger, we ought to expect a coefficient of regression of about this value. 
The theory used will be that of p. 273 of the theoretical part of this paper. The 
weighted mean fecundity found for the arrays of aunts and nieces was as follows :— 





Without grouping. 


With grouping. 


Arrays o£ aunts . . , , . 
Arrays of nieces .... 


•6195 
•6346 


•6199 
•6338 



The grouping was done in fecundity units of |-, i.e., 1/30 change in fecundity. The 
agreement may accordingly be considered very good. The " aunts " are the daughters 
of the older sires, who owing to in-and-in breeding form a comparatively small group, 
and are the sires of mares belonging to the older period, whose fecundity is much 
weakened by causes already referred to. Their mean fecundity is slightly less than 
that of granddams, given on p. 305, while the mean fecundity of their nieces agrees 
well with that for the corresponding group of mares. 

The method of grouping being adopted, a correlation table was formed for the 
mean fecundities of arrays of mares, daughters of a sire, and of arrays of mares, 
daughters of his sire. This is Table XVI. Here each mean is weighted with the 
number of pairs of aunts and nieces in the two arrays, i.e., the extent of the data on 
which it is based. It represents accordingly 138,424 pairs of aunts and nieces. 
The following results were obtained, corresponding to 687 pairs of sires : — - 



Sire's Sire. 




Sire. 


M a = -6199. 


M a ,: 


= -6338. 


~(r a = -04344. 


ov : 


= -07609 


B = 


•1174. 





It will be at once noticed how much more variable are the array-means for the 
sire than for the sire's sire. The means of many of the sire's arrays are based upon 
small numbers, which would have been selected out, if we had gone to another 
generation as in the case of the sire's sire. 

It will clearly not be legitimate in this case to put a' a = <r a as suggested on p. 274. 
There is probably no secular change of importance here, but the sire's sire requiring 



* <■ 
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three generations from the record is really more stringently selected than the sire 

with only two. We can now form or and cr by (xxviii.) and (xxix.), if we adopt 

suitable values of cr and cr 7 , p 3 as we have seen, may with high probability be put 
equal to '3 (p. 309). cr for groups of daughters, on p. 309, is given as '1855, but since 
this certainly included a fair number of what are now aunts, it must be somewhat too 

low for cr' . We can well put c/q equal to the *1888 of the mares on p. 304, cr for aunts 
cannot be as low as the standard-deviation of dams on that page, as many of the 
aunts may never appear in the record as granddams, # i.e., they are less stringently 
selected. The mean of the two results for mares and dams may, perhaps, be taken 

as a close enough approximation for our present purpose, or a- Q = '1765. We then 
deduce 



a = -1739. 



cr 



1QKK 



If we compare the results now found with those for sisters cited on pp. 308 and 309, 
we find :— - 





"Aunts." 


" Sisters." 


" Nieces." 


M 

i 


•6199 

•0434 

Ilui/ 


•6371 
•0544 
•1855 


•6338 

♦0761 
-1955 



The accordances and divergences are much what we might expect, except in the 
case of cr a . We should, a priori, have expected " sisters " to have approached 
nieces more nearly than aunts. The work has been gone carefully through, but I 
have not succeeded in finding any error. In the ie nieces," of course, the weighting 
of an outlying fecundity-mean due to a sire with but few daughters, may still be 
large, if his sire have numerous daughters ; this cannot occur in the case of " sisters," 
as the weighting depends only on the number in the array. The like heavy 
weighting cannot usually occur in the case of " aunts," for they are, as a rule (owing 
to selection to the third generation) daughters of old and famous sires, with plenty 
of material for basing averages upon. We do not get many "nieces" attached to 
" aunts," who are not daughters of famous sires. Such is probably the source of 
divergence in cr a between nieces and sisters. 

Using formula (xxviii.), on p. 274, we find 

r' = -0114, 

and for the regression coefficient '0128. 

This value is much below the '05 of the law of ancestral heredity, and below the 
reduced value *02, which we might have expected to reach. Still, it again shows 

* Every dam appears as a granddam, otherwise the fecundity of the daughter could not have been 
found . 
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positive correlation, and we may conclude that the patent character in the daughter 
is inherited latently through the male line. # 

But there is another and far more significant method of looking at this result, 
namely, by considering the meaning of E on p. 274. We may treat the fecundity of 
daughters as really a character of the sire, and their mean fecundity as a measure of 
a latent character in him. R is then the correlation between a latent character in 
both a stallion and his sire, and we see that it is sensibly inherited for R = '1174. 
To compare with the law of ancestral heredity, we must use the coefficient of regres- 
sion, for the stallions are much more variable than their sires. We find 

Regression of stallion on sire = '2056, 

which carries us a long way in the direction indicated by that law. Thus it is 
extremely probable that this law of inheritance applies not only to the inheritance of a 
patent character, or of a character latent in one sex and transmitted to a second, but 
also to the inheritance of a character latent both in the transmitter and receiver. The 
present method accordingly seems applicable to the inheritance of a character latent in 
two individuals, if we take the mean of the character, when patent in the offspring, as a 
measure of its strength in the individual in whom it is latent. If l Y be the measure 
of a latent character in a parent, then the offspring will have a mean value ql x + c x 
of this character, where q is the coefficient of parental regression and c x a constant. 
If l 2 be the measure of the same latent character in a relative, then the offspring in 
this case will have ql 2 + c 2 of the character. But the correlation of l x and l 2 will be 
identical with that of ql x + c x and ql 2 + c 2 , as I have shown elsewhere.t Thus the 
mean of the patent character in the offspring may be used to measure the correlation 
between latent characters in their parents. 

To sum up our results for thoroughbred mares, we conclude that their fecundity, 
notwithstanding the imperfections and difficulties of the record, has been demon- 
strated to be inherited, and this, both through the male and female line, so far as 
we can judge, with an equal intensity. The apparent value of this intensity, except 
in the case of latent characters, is much below that required by the law of ancestral 
heredity, roughly, perhaps, 2/5 of that value; but there is considerable reason 
to think that this reduction may take place owing to the presence of fictitious values 
in the record arising from the peculiar circumstances under which thoroughbred 
horses are reared and bred. These fictitious values would hardly influence the 
means and variability of arrays like they must do the relationship between pairs of 
individuals. Hence, when we deal with such means and variabilities as in the cases 
on pp. 309 and 313, we find a much closer approach to the law of ancestral heredity. 
Fecundity is certainly inherited ; that it is inherited according to the Galtonian law 

* As a matter of fact, this conclusion is stronger than it appears here, for the correlation between 
nieces and aunts was worked out, without grouping, for fourteen distinct series, and in thirteen of them 
was found to be sensibly positive ; in the fourteenth it was found to have an insignificant negative 
value. 

t " On the Reconstruction of the Stature of Prehistoric Kaces," < Phil. Trans.,' A, vol. 192, p. 183. 
VOL. CXCII.- — A. 2 S 
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is not demonstrated, but may be treated as probable until the results of further 
investigations — preferably by breeding experiments instituted for this very purpose — 
are available. 

(18.) Conclusion. — The investigations of this memoir have been to some extent 
obscure and difficult, but the general resuit is beyond question. 

Fertility and fecundity, as shown by investigations on mankind and on the 
thoroughbred horse, are inherited characters. 

The laws of inheritance of these characters are with considerable probability those 
already developed in my memoir on the Law of Ancestral Heredity for the inherit- 
ance of directly measurable organic characters. 

In the course of the work it has been shown how a numerical measure may be 

obtained for the inheritance of a character by one sex from the other, when it is 

t, * 

patent in the former and latent in the latter. Fertility and fecundity purely latent 
in the male (in the sense here used) are shown to be transferred by him from his 
mother to his daughter. Thus Darwin's views with regard to the transmission 
through one sex of a character peculiar to the other are given a quantitative 
corroboration/" 

When we turn from these points to their weight and importance for the theory of 
evolution, we are at once encountered by all the wide-reaching principles which flow 
from the demonstration that genetic (reproductive) selection is a true factor of 
development. Let us look at these a little more closely. 

If natural selection, were to be absolutely suspended, i.e., if there were no 
differential death-rate at all, then development would not for a moment cease. Not 
only is fertility inherited, but there can be small doubt that it is closely correlated 
with all sorts of organic characters ; thus the inheritance of fertility marks, the 
moment natural selection is suspended, a progressive change in a great variety of 
organic characters. Without a differential death-rate the most fertile will form in 
every generation a larger and larger percentage of the whole population. There are 
very few characters which may not be supposed to be more or less directly correlated 
with fertility, and in reproductive selection we see a cause of progressive change 
continuously at work.t There is, so to speak, in every species an innate tendency 
to progressive change, quantitatively measurable by determining the correlation 
coefficients between fertility and organic characters, and. between fertility in the 
parents and in the offspring. This " innate tendency" is no mysterious " force " 
causing evolution to take place in a pre-ordained direction ; it is simply a part of the 
physical organisation of the individual, which does not leave fertility independent of 

* The method is perfectly general, and a value can always be found for the intensity of transmission 
of a sexual character" through the opposite sex. We could obtain, for example, a numerical measure of 
the manner in which a bull transmits good milking qualities to its offspring. 

t I have endeavoured to show ( s Roy. Soc. Proc., 5 vol. 59, p. 303), that fertility is correlated with 
stature in woman, I hope later to return to the correlation of fertility and physique. 
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physique and organic relationship, or leave these characters uncontrolled by the 
principle of heredity. It seems to me, therefore, that the results of this memoir 
force on us some modification of current views of evolution. The suspension of natural 
selection does not denote either the regression of a race to past types, as the 
supporters of panmixia suggest, or the permanence of the existing type, as others 
have believed. It really denotes full play to genetic or reproductive selection, which 
will progressively develop the race in a manner which can be quantitatively predicted 
when once we know the numerical constants which define the characters of a race 
and their relation to racial fertility, In other words, natural selection must not be 
looked upon as moulding an otherwise permanent or stable type ; it is occupied with 
checking, guiding, and otherwise controlling a progressive tendency to change. 

So soon as a species is placed under a novel environment, either artificially or 
naturally, the equilibrium is disturbed, and it will begin to progress in the manner 
indicated by genetic (reproductive) selection, until this progress is checked by the 
development of characters in a manner or to an extent which is inconsistent with 
fitness to survive in the new surroundings. Within a very few generations a novel 
environment, sympathetic so to speak to the progressive tendency indicated by 
reproductive selection, produces the suitable variations without the assistance of 
natural selection. It seems to me that this principle ought to be borne in mind 
when, in laboratory experiments or in artificial breeding, natural selection is wholly 
or largely suspended, or again is altered in type ; the species dealt with is unlikely 
to remain constant for several generations, but will develop in the direction indicated 
by genetic selection. Further, when stable types of life like the English sparrow are 
taken to America, or the English rabbit to Australia, where initially they fill a more or 
less vacant field among living forms, and natural selection is in part suspended, we 
should expect in a few generations a considerable divergence in type.* The converse 
aspect of the problem is also of great importance; namely, the natural selection 
of physical characters must tend to indirectly modify fertility and fecundity, if 
these be correlated with those characters. Variations in the fertility of local races 
need not be looked upon as due directly to environment, but may arise from the 
selection of characters correlated with fertility, combined with the law that fertility 
is itself an inherited character. 

Lastly, the inheritance of fertility involves the "acceleration" of fertility; a 
race, natural selection being suspended, tends not only to increase but to increase 
at an increasing rate. This principle is again full of meaning, not only for the study 
of the manner in which lower types of life rapidly expand under changed environ- 
ment, but also for the problems set to those philosophers who may desire that the 
most social and not the most fertile type of citizen may predominate in our modern 
civilised communities, where the state and public opinion to a greater or less extent 
hinder natural selection from playing the great part it does in wild life. 

* It would be interesting to know whether the size or frequency of the litter of the Australian 
rabbit is greater than that of the English. 
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